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Tropical protection 


After years of talk, the palm-oil industry is looking into adopting environmental standards. 
Such rules must be strong, and need to be implemented. 


ore than 100 major companies worldwide have made 
Meznnitnent to promote the use of environmentally sustain- 

able palm oil over the past few years. This is to their credit. 
Palm oil finds its way into everything from food and cosmetics to biofu- 
els, but the expansion of palm plantations has driven widespread defor- 
estation — as well as carbon emissions — in places such as Indonesia 
and Malaysia. To various degrees, companies that trade in palm oil have 
promised to halt the use of oil from newly cleared land, but implement- 
ing such goals is not easy. The latest attempt to create workable stand- 
ards comes from an industry consortium in consultation with a team of 
respected scientists. Their report is due out in December, anda draft is 
available for public comment until 31 July (see go.nature.com/rt7fue). 

This High Carbon Stock (HCS) Study was formally launched last 
year, when five leading palm-oil producers, including Sime Darby in 
Kuala Lumpur and IOI Corporation Berhad in Putrajaya, Malaysia, 
signed the Sustainable Palm Oil Manifesto. That document com- 
mits signatories to halting the expansion of palm plantations in 
dense forests where carbon emissions would be highest, but says that 
the palm-oil industry cannot focus solely on environmental issues. 
Environmentalists immediately accused the companies of seeking to 
undermine attempts to produce a stricter set of guidelines, and to delay 
obvious solutions with complicated science. 

There is some truth to this, but the merits of a given project do 
depend in part on the social and economic context in which it is 
situated. Decisions about land use are rarely made on the basis of 
environmental criteria alone, and many of the regions in which the 
plantations are located — or will be located — would see social and 
economic benefits from an orderly palm-oil industry. 

The question is where to draw the line. Most would agree that it does 
not make sense to tear down old-growth forests, which store a lot of 
carbon and are home to a diverse array of plants and animals. The same 
could probably be said for selectively logged forests, where only the big- 
gest and most valuable trees have been taken, which are still high in 
carbon and biodiversity. Everybody agrees that it would be wise to focus 
development on abandoned land that has already been fully cleared, 
and so has little carbon or biodiversity to speak of; in such areas, a palm 
plantation could increase the carbon stock, thereby alleviating global 
warming. In between, on degraded and heavily logged forests and in 
areas where forests are actively regrowing, there is more room for debate. 

The current draft of the HCS Study report seeks to create a frame- 
work for evaluating projects on the basis of both land type and socio- 
economic conditions. It proposes classifying land according to the 
state of forests: at the extremes, green represents the go-zone, such as 
already cleared land, and red the no-go zone, where primary forest 
remains. In the centre is ambiguous amber, a middle zone in which 
trade-offs are possible. If the social and economic benefits are high 
enough, perhaps a small hit to the climate is acceptable and could be 
offset by protecting additional land elsewhere. The first step in making 


such decisions is to get data on forest cover, and the study advocates 
mapping land with both high-resolution satellites and aircraft-based 

lasers to gather detailed measurements of forest structure. 
Confusingly, before the HCS Study launched, major environmental 
groups were engaging the industry in separate negotiations known as 
the High Carbon Stock Approach. Those talks intended to create a 
more conservative set of guidelines that often default to the red no-go 
zone when it comes to development. The HCS 


“The industry Study consciously goes in the other direc- 
mustfindaway tion, acknowledging that there may be cases 
topromoteboth — inwhich natural forests could be converted 
environmental to plantations in the name of alleviating 
protection poverty. “This is the essence of the ‘quid pro 
and social quo’ explored in this Study,’ the authors write. 


Ultimately, the industry must to find a way 
to promote both environmental protection 
and social well-being. Finding the right formula will not be easy, but it 
is a sign of progress that all sides are seeking a solution. In theory, this is 
the duty of government, but governments across the tropics have had a 
hard time controlling rampant development that has left many citizens 
behind. It would be a step in the right direction for environmentalists, 
scientists and businesses to agree on a set of meaningful standards. Then 
it would be a matter of ensuring that companies keep their word. m 


well-being.” 


Secret service 


Government labs should be subject to the same 
transparent oversight as academic facilities. 


security agencies can border on the ridiculous. US government 

paranoia over terrorism led to the generally despised — and 
questionably effective — airport rituals of prohibiting bottles that con- 
tain more than 100 millilitres of most liquids and subjecting all passen- 
gers to radiation in a virtual strip search. Public panic led to similarly 
overblown US responses to the 2014 Ebola outbreak, including the 
forced quarantine of people who were never exposed to the virus and 
had no chance of causing an epidemic (see page 502). 

How, then, was the US Department of Defense (DOD) able this 
year to send live anthrax spores across at least seven international bor- 
ders and to at least 183 labs without the authorities noticing? If there 
is anywhere that paranoid officials should want to monitor when it 
comes to anthrax, it is the DOD. After all, the DOD works with more 
anthrax than any other institution, and the only known bioterror 
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attack using anthrax spores as a weapon originated ata DOD lab. 

Oversight systems seem to have been watching everything except the 
most likely source ofa threat. 

When this year’s failure came to light, the DOD immediately began 
a 30-day investigation of itself. Its 38-page conclusion, released to the 
public last week, blamed no one in particular (see go.nature.com/ltcn6f). 
The military determined that the radiation procedure being used at the 
lab — Dugway Proving Ground in Utah — to kill the spores was ineffec- 
tive. It emphasizes that no one was harmed, and that there is no proven 
method to kill the notoriously resilient spores. Both these things are true. 

What is still unclear, however, is why the procedure was not better 
tested. The US Centers for Disease Control and Prevention (CDC) 
does not have particular standards for inactivation protocols. But if it 
did, Dugway’s protocol surely would not meet them: the lab had never 
optimized the procedure, and the base’s own records showed that the 
process failed once in every five attempts. Furthermore, neither the 
sending nor the receiving labs had done enough to verify that the sam- 
ples were dead. Dugway, for instance, tested only 5% of each sample 
for viability, which would not have detected alow concentration of live 
spores. In a twist of irony, DOD scientist Bruce Ivins, who was allegedly 
responsible for the 2001 anthrax attacks, had suggested that half ofa 
sample should be screened to rule out viability. 

Dugway has been in hot water before. An investigation by the news 
outlet USA Today found that the CDC had reprimanded the facility 
eight years ago for using a different experimental protocol to inactivate 
anthrax spores and then shipping them even when tests showed that 
they were still alive. According to USA Today, Dugway was let off with 


a warning, and the incident was not included in the DOD’s annual 
report to Congress. 

Academic labs could be justifiably rankled at the amount of money 
and time they have to spend complying with regulations on less danger- 
ous pathogens and harmless amounts of radiation. A university that 
flouts CDC regulations would probably be subject to harsh penalties. 
But US law allows government labs to maintain secrecy around their 

procedures and the results of investigations 


“Tt should not into their biosafety mishaps, of which there 
be left up to seem to be many. 

the media to That could soon change. On 28 July, both 
discover serious the DOD and the CDC were hauled before 
accidents at a congressional committee that is demand- 


ing answers and a new probe into the latest 
incident. The committee has also called for 
the agencies to produce a list of the labs that are authorized to work 
with anthrax and other bioterror agents, and for details of biosafety 
violations. Earlier this month, the CDC announced that it is beginning 
a 90-day review of its biosafety procedures for federal research labs that 
work with dangerous pathogens. 

It should not be left up to the media to discover serious accidents 
at the agencies charged with protecting people from bioterrorism. To 
be clear, the research they perform on anthrax and other pathogens is 
essential for biosecurity. Incompetent oversight combined with a cul- 
ture of secrecy could threaten that work. And, given the overabundance 
of caution applied elsewhere, there should be some spare to deploy at 
the government labs at which it is most needed. m 


agencies.” 


Realistic risks 


The communication of risk in disease outbreaks 
is too often neglected; that must change. 


South Korean hospitals is effectively over, with no new cases 

since 2 July. Since it began on 11 May, a total of just 186 people 
were infected by the coronavirus, 36 of whom have died. The episode 
was tragic, but its economic and social impact was disproportionate. If 
the world is to respond effectively to infectious-disease outbreaks, then 
the authorities, the media and communities must pay more attention 
to risk communication. 

The only people at real risk of infection in South Korea were those 
who had shared a hospital area with someone who had MERS. Yet at 
the outbreak’s peak in early June, thousands of schools were needlessly 
closed and public events were cancelled. Tourist numbers dropped by 
41% compared with the same month last year: a US$10-billion loss 
that is expected to knock 0.1% off the country’s gross domestic product 
growth this year. The only winners were those selling the ubiquitous 
and superfluous face masks. 

One important question — and lesson to learn — is how the authori- 
ties failed both to convey the limited threat posed by MERS, and to 
persuade the media and public that they had the outbreak under control. 

Public trust in Korean officials was already low after a perceived 
bungled response to the sinking of the ferry MV Sewol last year, which 
killed more than 300 people, many of them secondary-school pupils. 
When MERS struck, the authorities foolishly declined to identify the 
affected hospitals publicly, allowing rumours — amplified by social 
media — to fill the space. This faltering start was unfortunate because 
the government did get its act together soon after. Its transparency 
in reporting new cases became exemplary, as did its public-health 
response — including the massive task of tracing and isolating the 
more than 16,500 people who had been in contact with infected 


Ts outbreak of Middle East respiratory syndrome (MERS) in 
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patients. The last contact was released from isolation this week. 

Disease outbreaks are frightening, and overreaction to a virus that 
can kill is an understandable human response. It is one that needs to 
be understood and managed, not dismissed as irrational. 

This puts great responsibility on the shoulders of the press and politi- 
cians, and often we see that some are not up to the job. When a handful 
of Ebola cases occurred on US soil last year, it sparked what President 
Barack Obama has described as “hysteria. Many media reports were 
balanced and excellent, but too much of the reporting was excessive 
and sensationalist. Complicating matters further, right-wing political 
opportunists and pundits used the Ebola cases to take partisan shots at 
the Obama administration. Combined with the 24/7 news cycle, and 
again amplified by social media, coverage of what was a legitimate news 
story became a shambolic and sorry mess, utterly detached from the 
reality — that the United States faced no threat of an Ebola epidemic. 

This had real consequences. Several politicians, including Chris 
Christie, the governor of New Jersey, implemented unnecessary and 
counterproductive measures, such as forced quarantine of US health- 
care workers returning from West Africa. Republican presidential 
hopeful Donald Trump showed a troubling grasp of the issue, and 
called for US borders to be sealed to those arriving from the region, 
including health-care workers. If this was the US response to a non- 
existent disease threat, what would its reaction be to a serious epidemic 
threat? Some outbreak-response officials think that the trend towards 
instantaneous news, compounded by social media, could interfere with 
effective public-health interventions and result in societal chaos. 

Overreactions to outbreaks that pose no large threat can distract from 
those that do, and the priority is to eliminate the threats at source. Ebola 
must be stamped out in West Africa, and MERS must not be allowed 
to fester in the Middle East, where it is endemic in camels. Researchers 
need to identify and close the routes by which the MERS virus spreads 
to people. Social-science researchers can help to unravel complex factors 
affecting public reactions to outbreaks, and how 
authorities can build trust, so that risks can be 
better communicated. They might also ask how 
European countries managed to respond coolly 
to the arrival of both MERS and Ebola cases. = 
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n recent weeks, we have learned that Pope Francis enticed Cuban 
[rec Raul Castro to consider a return to Catholicism, and has 

ended a dispute involving US nuns that will allow them to return 
to serving the poor free from the suspicion of heresy. 

Perhaps most surprisingly, at least to this Protestant ecologist embed- 
ded for 30 years ina Roman Catholic university, the Pope has suggested 
that humans should not breed “like rabbits”, despite his church's con- 
tinued prohibition of birth control. 

Pope Francis is clearly a man on a mission to shake things up. 
Could the world’s leading Catholic help to bridge the divide between 
science and the Protestant views that dominate the religious ‘anti- 
science movement? I think that he could. 

In his recent encyclical on humans and the environment, Pope 
Francis described environmental degradation 
with great scientific accuracy, and he linked it to 
economic exploitation and the plight of the poor. 
This is a challenge to many conservative Prot- 
estants who believe that humans, because they 
are made in God's image, have a divine right to 
exploit the natural world. 

The Pope's argument is a powerful one, and 
addresses those with power, especially in the 
United States. The views of notorious climate- 
sceptic Senator James Inhofe (Republican, 
Oklahoma) on the threat of global warming, for 
example, are underpinned by his strong Protestant 
conviction that God created natural resources for 
humans, and that we are arrogant in thinking that 
we can affect God’s plan for the Earth. By contrast, 
many environmentalists argue that humankind 
should protect nature for its intrinsic value, with 
little apparent regard for the importance of its use for human welfare. 

By framing protection of the environment as protecting human 
welfare, the Pope has linked the interests of groups that are often at 
odds. He offers some middle ground on which both sides of this polar- 
ized debate can meet and work towards a mutually desirable future. 

Such a compromise between the extremes of the religious and 
environmentalist positions could also help to defuse other sources of 
tension between faith and science. To many people, the two cannot be 
reconciled — so much so that when I tell people I am a biologist, believe 
in evolution and work on environmental issues, I am often told that I 
cannot be a Christian. Sadly, this is the message in many conservative 
Protestant churches: choose between science and faith. 

The same polarization is urged by many prominent popularizers 
of science and the ‘New Atheists’ — with 


Richard Dawkins as their figurehead. Is it NATURE.COM 
so surprising, then, that in the United States _ Discuss this article 
especially, atheism is over-represented among __ online at: 


scientists, and that science-faith polarization is _go.nature.com/rgzfié 


ISITSO 
SURPRISING 


THAT, IN THE UNITED 
STATES ESPECIALLY, 


ATHEISM 


IS OVER- 
REPRESENTED AMONG 


SCIENTISTS? 


Faith and science can find 
common ground 


Pope Francis has found a meeting place for those with extreme religious and 
environmentalist stances, says David M. Lodge. 


increasingly reflected in political and cultural discourse? 

For example, nothing in the official teaching of Catholicism opposes 
evolution. Creationism is a recent Protestant invention, based on 
extreme, literal interpretations of the first three chapters of the Bible's 
book of Genesis. Catholicism relies more on an interpretation of the 
scriptures that is rooted in a tradition of reason informing faith. Yet 
when I ask my biology undergraduates whether they feel a conflict 
between their faith and evolution, about half of every class — 85% of 
whom are Catholic — say yes. 

The students respond in this way because evolution, alongside issues 
such as climate change, stem cells, abortion and gay marriage, has been 
conscripted into the culture wars, with science increasingly suffering 
collateral damage. And as the culture wars have forced people to choose 
sides, respect for science is now divided along 
political lines too, with huge influence on policy. 

US environmental-protection policies, which 
began as bipartisan efforts to protect human and 
environmental health, have become destructively 
partisan. It was the Republican president Rich- 
ard Nixon who in 1972 signed the Clean Water 
Act that brought Lake Erie back from the dead, 
restoring one of the most economically valuable 
freshwater fisheries in the world. Nowadays, 
efforts to improve water and air quality are too 
often supported by Democrats and opposed by 
Republicans — on the grounds that environ- 
mental protection harms human welfare, and 
that because the world is temporary, long-term 
protection is unnecessary. They dismiss scien- 
tists, who increasingly quantify the great extent 
to which environmental protection benefits 
humans, as just another special-interest group. 

As a Protestant scientist, I am distressed to see my faith twisted into 
support for such short-sighted extremism. Martin Luther, the great 
Protestant reformer, once said: “Even if 1 knew that tomorrow the world 
would go to pieces, I would still plant my apple tree.” Like Pope Francis, 
he understood the importance of loving and tending the gift of creation. 

If Pope Francis can persuade the communist Raul Castro to recon- 
sider Catholicism, I can hope that the Pope’s respect for the scientific 
consensus on climate change will foster a more constructive dialogue 
between the communities of science, faith and policymakers. His rec- 
ognition that the economy and the environment are inextricably linked, 
especially for the desperately poor, builds on a foundation that is older 
and deeper than the recent US culture wars. = 


David M. Lodge is director of the University of Notre Dame 
Environmental Change Initiative, Indiana, and editor (with historian 
Christopher Hamlin) of Religion and the New Ecology (2006). 
e-mail: dlodge@nd.edu 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Qubit controlina 
3D matrix 


Qubits — quantum bits which 
store and process information 
in quantum computers — can 
be controlled individually 
ina 3D structure without 
disturbing nearby atoms. 
Neutral atoms show 
promise as qubits when they 
are cooled and trapped by 
light, but manipulating one 
atom without disturbing 
its neighbours is difficult. 
David Weiss and his team at 
Pennsylvania State University 
in University Park controlled a 
single atom ina5x5 x5 array 
of trapped caesium atoms by 
firing two beams of circularly 
polarized light so that they 
intersected at the target atom. 
This caused the energy levels 
of electrons in the atom to 
shift, allowing the researchers 
to change its quantum state by 
hitting it with microwaves. 
The method should make 
it easier to scale up quantum 
computers that use this kind of 
qubit, the researchers say. 
Phys. Rev. Lett. 115,043003 (2015) 


Stomach tissue 
made in adish 


Mouse embryonic stem cells 
can develop into 3D ‘mini 
stomachs’ in the lab. 


How boa constrictors really kill 


Animals that have been squeezed to death by 
snakes probably die of circulatory arrest rather 
than of suffocation as was thought. 

Scott Boback at Dickinson College in Carlisle, 
Pennsylvania, and his team anaesthetized 
rats and implanted probes and catheters to 
measure their heart rate, blood pressure and 
blood chemistry as the animals were being 
squeezed by a boa (Boa constrictor; pictured). 
Constriction lasted an average of 6.5 minutes, 


Researchers have previously 
used stem cells to make 
parts of the stomach, but not 
the whole organ. Taka-aki 
Noguchi, Akira Kurisaki at the 
University of Tsukuba in Japan 
and their team made stomach 
tissue — including the main 
food-containing part — by 
adding several key growth 
factors to the stem-cell culture 
after six days. These turned on 
expression of the Barx1 gene, 
which is essential for stomach 
development. After about 
60 days, the cells developed 
into stomach tissue (pictured) 
that contained several 
specialized types of stomach 
cell. The mini stomach also 
secreted a digestive hormone 
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and acid, and had a similar 
gene-expression profile to that 
of adult stomach tissue. 

This system could be used 
to study stomach diseases, the 
authors say. 
Nature Cell Biol. http://doi.org/ 
6gf (2015) 
Fora related News Feature on 
organoids, see page 520. 


Marijuana’s good 
without the bad 


Mice treated with marijuana’s 
active component, THC, 
along with other key 
molecules, can experience 

its pain-relieving benefits 
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but the rats’ peripheral blood pressure dropped 

by halfas early as 6 seconds in. By the end of 

constriction, the rats’ blood chemistry showed 

signs of system-wide circulatory problems 

and there was evidence that the heart had 

undergone significant electrical dysfunction. 
The authors suggest that snakes may release 

their prey only after detecting that it has 

experienced irreversible heart failure. 

J. Exp. Biol. 218, 2279-2288 (2015) 


without the usual memory 
impairments. 

THC binds to the CB1 
cannabinoid receptor in 
the brain, causing negative 
effects such as poor memory 
and anxiety. However, it also 
affects behaviours that are 
regulated by the serotonin 2A 
receptor. Patricia Robledo 
at Pompeu Fabra University 
in Barcelona, Spain, and her 
colleagues gave THC to mice 
lacking the serotonin receptor, 
and found that the animals 
did not show signs of memory 
loss but could still tolerate 
painful stimuli. Studies of 
cells expressing both types 
of receptors revealed that 
the receptors physically 


SCOTT M. BOBACK 


TAKA-AKI K. NOGUCHI ET AL./NATURE CELL BIOL. 


SCIENCE/AAAS 


interact. Mice that were given 
THC and peptides that stop 
the receptors interacting 

in the brain did not show 
memory problems, but still 
experienced pain relief. 

The findings suggest a way 
to minimize the negative 
effects of medical marijuana, 
the authors say. 

PLoS Biol. 13, 1002194 (2015) 


Telescope spies 
early galaxy’s birth 


Astronomers have spotted 
the glow from one of the most 
distant galaxies ever seen in 
the early Universe. 

Roberto Maiolino at the 
University of Cambridge, UK, 
and his colleagues used the 
high-resolution Atacama Large 
Millimeter/submillimeter 
Array (ALMA) telescope in 
Chile to observe three faint 
galaxies that began forming less 
than one billion years after the 
Big Bang. In one galaxy they 
detected clouds of cold ionized 
carbon that was shifted away 
from the bright, star-forming 
centre. This matches models of 
early galaxy formation, which 
predict that active young stars 
disperse such clouds. 

The data will help to test 
theories about how the 
Universe's first stars and 
galaxies formed, the team says. 
Mon. Not. R. Astron. Soc. 452, 
54-68 (2015) 


Acritical period 
for brain health 


Disrupting a signalling 
molecule in week-old mice 
prevents their neurons from 
altering connections with 
other neurons in adulthood. 

Cognitive problems can 
result from an inability to 
strengthen or weaken these 
connections, called synapses, 
in the brain in response to 
external triggers — a process 
known as plasticity. Neil 
Hardingham of Cardiff 
University, UK, and his 
colleagues blocked signalling 
by the DISC1 protein in 


7-day-old mice for 6-48 hours. 
When the animals reached 
adulthood, their synapses failed 
to adapt in response to signals 
generated by stimulating one of 
their whiskers. 

In humans, changes to 
DISCI1 signals during this 
critical period soon after birth 
could underlie psychiatric 
symptoms that emerge during 
adolescence, the authors 
suggest. 

Science 349, 424-427 (2015) 


Microbe war waged 
with biofilms 


Sticky films of microbes form 
as the result of competition, 
not cooperation, between 
bacterial strains. 

Scientists had thought that 
biofilms form cooperatively, 
with multiple strains of 
bacteria growing to protect 
one another. But when Kevin 
Foster of the University 
of Oxford, UK, and his 
colleagues mixed two different 
strains of the opportunistic 
pathogen Pseudomonas 
aeruginosa, they found that 
the strains competed with each 
other, with one producing 
antibiotics called pyocins to 
kill its opponent. This mixture 
produced greater amounts of 
biofilm than did the individual 
strains, and introducing other 
antibiotics into the mix further 
stimulated biofilm formation. 

The results suggest that the 
clinical use of antibiotics could 
be causing bacteria to make 
more biofilms, making the 
microbes harder to stamp out. 
PLoS Biol. 13, e1002191 (2015) 


| ENGINEERING 
Origami for thick 
materials 


Origami patterns designed 
for thin pieces of paper can be 
extended to thicker materials 
as well. 

Previous attempts to 
fold 3D materials required 
adding layers of material or 
changing their geometry. To 
avoid this, Zhong You at the 
University of Oxford, UK, 


RESEARCH HIGHLIGHTS MiiiSaiaa¢ 


SOCIAL SELECTION 


Popular topics 
on social media 


Science reporting flaws exposed 


Researchers often complain about inaccurate science 
stories in the popular press, but few air their grievances ina 
journal. Samuel Mehr, a PhD student at Harvard University 
in Cambridge, Massachusetts, discussed in a Frontiers 
in Psychology article some examples of media missteps 
from his own field — the effects of music on cognition. 
The opinion piece gained widespread attention online. 
“A very nice review of how media communicates science 
badly,’ tweeted Arseny Khakhalin, a neuroscientist at Bard 
College in Annandale-on-Hudson, New York. Seeing room 
for blame on both sides, Terry Hébert, a pharmacologist 

at McGill University in Montreal, 


> NATURE.COM 
For more on 

popular papers: 
go.nature.com/awqdox 


and his colleagues developed 
a method of assembling thick 
materials so that the hinges 
where they meet move in 

a limited number of ways. 
The researchers showed. 

how carefully choosing the 
placement of the hinges and 
creases allows the structures 
to move and fold (pictured) 
in identical ways to origami 
patterns that use 2D 
materials. 

The method could 
eventually improve the 
construction of foldable 
structures such as solar panels 
or aircraft wings, the authors 
report. 

Science 349, 396-400 (2015) 


Reroute bile for 
better metabolism 


Altering the flow of bile in the 
mouse gut achieves the same 
weight-loss benefits as gastric- 
bypass surgeries. 

Ina gastric bypass, the 
stomach pouch is made 
smaller and connected to the 


Canada, posted on Facebook: 
“Journalists and scientists should both 
strive for accuracy in their claims.” 
Front. Psychol. 6, 00988 (2015) 


middle of the small intestine. 


Naji Abumrad at Vanderbilt 
University Medical Center 

in Nashville, Tennessee, and 
his team tested whether the 
success of such procedures can 
be replicated by redirecting 
bile fluids, which break down 
dietary fat in the upper gut 

to enable fat absorption. 
Rerouting the bile duct to the 
ileum (the lower part of the 
small intestine) in obese mice 
resulted in weight loss and 
other metabolic improvements 
that were similar to those 
seen after a gastric bypass. 
The bile diversion reduced fat 
absorption and shifted 

the composition of gut 
microbes to be similar to that 
of lean mice. 

Redirecting the bile duct in 
humans would be simpler than 
gastric bypass, but long-term 
safety studies are needed, the 
researchers say. 

Nature Commun. 6, 7715 (2015) 
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SEVEN DAYS nscsns 


Alzheimer’s drugs 


Alzheimer’s patients enrolled 
in clinical trials of two 
antibody drugs showed 

minor improvements in their 
symptoms, two US drug 
companies reported on 22 July 
at the Alzheimer’s Association 
International Conference in 
Washington DC. Eli Lilly’s 
drug solanezumab slowed the 
rate of cognitive decline by 
30% over two years. The drugs 
target amyloid-f protein in 

the brain, which accumulates 
in people with Alzheimer’s. A 
small trial of Biogen’s similar 
antibody, aducanumab, showed 
that high doses of the drug, but 
not moderate doses, removed 
amyloid and slowed symptoms. 
Both drugs have entered phase 
III trials. See page 509 for more. 


Nearly Earth 2.0 
NASAs Kepler spacecraft has 
found an extrasolar planet that 
is the closest thing yet to a true 
Earth analogue, the agency 
announced on 23 July. The 
planet, named Kepler-452b, 
orbits a bright star 430 parsecs 
(1,402 light years) away from 
Earth, within its star’s habitable 
zone, where temperatures are 
right for liquid water to exist on 
the planet's surface. Kepler- 
452b is 60% larger than Earth 
and may be rocky. If it is, it will 
be the first terrestrial planet 
discovered within the habitable 
zone of a Sun-like star. (Similar 
planets have been found that 
orbit stars cooler and dimmer 
than the Sun.) See page 511 for 
more. 


| __BUSINESS 
Lifespan lineage 


A genetic-ancestry company 
that boasts more than one 
million paying customers has 
teamed up with a Google- 
backed biotechnology firm to 
uncover lifespan-lengthening 
DNA. On 21 July, the 


A hazy halo around Pluto 


The New Horizons spacecraft has discovered hazes in 
Pluto's frigid atmosphere, NASA announced on 24 July. 
Fresh discoveries from the craft’s historic fly-by on 14 July 
include possibly the first evidence that Pluto’s atmosphere 

is beginning to freeze and fall as snow on the surface, which 
is what scientists had expected as its elliptical orbit takes it 
farther from the Sun. Powerful radio beams sent from Earth 
to the craft allowed researchers to measure the surprisingly 
low surface pressure of Pluto’s atmosphere as it bent the 
radio waves. Other findings include glaciers of nitrogen ice 
flowing from the Sputnik Planum plains. See go.nature.com/ 


bmmldk for more. 


genealogy firm Ancestry.com 
in Provo, Utah, announced 

a partnership between its 
subsidiary AncestryDNA 
and Calico, a research and 
development company 

in South San Francisco, 
California. Calico will use 
anonymized customer-genome 
data and ancestry records to 
search for genetic factors that 
contribute to longevity. 


Generic drug deal 
Jerusalem-based drug 
company Teva announced 
on 27 July that it would 
buy the generics arm ofa 
rival company for around 
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US$40.5 billion. Teva will 
pay $33.75 billion in cash 
and hand over roughly 
$6.75 billion in shares to 
Allergan, of Dublin, to 
acquire Allergan Generics. 
The companies say that 
they expect the deal to be 
completed early in 2016. 


Hot drugs approved 
Regulators have approved two 
members of a hotly pursued 
class of cholesterol drugs 
called PCSK9 inhibitors. On 
21 July, Biotechnology firm 
Amgen of Thousand Oaks, 
California, announced that 
the European Commission 
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had approved its drug, called 
evolocumab. Three days 
later, the US Food and Drug 
Administration approved 
Praluent (alirocumab), made 
by Sanofi US of Bridgewater, 
New Jersey, and Regeneron 
of Tarrytown, New York. 
Both drugs are approved for 
use in people with familial 
hypercholesterolaemia, a 
genetic condition that causes 
high cholesterol (see Nature 
496, 152-155; 2013). 


EVENTS 


Name that asteroid 
The Japan Aerospace 
Exploration Agency (JAXA) 
has invited the public to 
suggest names for the asteroid 
1999 JU3, which JAXA’s probe 
Hayabusa-2 will reach in 
2018. The spacecraft, which 
launched in December 2014, 
will collect samples from the 
rock and return to Earth in 
2020 (see Nature http://doi. 
org/6dg; 2014). Candidate 
names will be vetted by 

a panel of specialists, the 
asteroid’s discoverers and, 
ultimately, the International 
Astronomical Union. The 
contest, which opened on 

22 July, resembles NASA's 

call to name features on 
Pluto; submissions close on 
31 August (see go.nature.com/ 
rybeqw). 


Outbreak ending 
The South Korean outbreak 
of Middle East respiratory 
syndrome (MERS) is set 

to be declared as over. 
Beginning in hospitals in 
mid-May, the coronavirus 
outbreak was quickly brought 
under control, with case 
numbers peaking on 1 June. 
In total, 186 people were 
confirmed as infected, 36 of 
whom have died. No new 
cases have occurred since 

2 July. On 27 July, the last of 
16,693 contacts was released 
from isolation. The outbreak 
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can officially be declared over 
when no new cases have been 
detected for 28 days. 


Weapons warning 


Nearly 2,000 researchers and 
commentators released an 
open letter on 28 July calling 
for the United Nations to ban 
autonomous weapons. The 
letter warns that weapons 
using artificial intelligence 
(AI) may be only years away, 
and that their creation could 
lead to an “AI arms race” It was 
released at the International 
Joint Conference on Artificial 
Intelligence in Buenos Aires, 
Argentina, and signatories 
include more than 1,000 AI 
researchers, as well as physicist 
Stephen Hawking and inventor 
Elon Musk. 


Nigeria polio-free 
On 24 July, Nigeria marked 
one year since its last case of 
polio caused by wild virus. 

The West African nation will 
be removed from the list of 
polio-endemic countries once 
laboratory tests from across the 
country are complete. Pakistan 
and Afghanistan are now the 
only two countries where 
transmission of polio virus has 
never been halted. Africa could 
be certified polio-free as soon 
as August 2017, three years 
after the continent's last case, 
in Somalia, which was caused 
bya virus that spread from 
Nigeria. See go.nature.com/ 
uomxdr for more. 


TREND WATCH 


Ending fossil-fuel subsidies, 
which cost US$550 billion 
globally in 2011, could free up 
money to achieve universal 
access to basic services such 
as clean water, according to 
researchers at the Mercator 
Institute on Global Commons 
and Climate Change in Berlin 
(M. Jakob et al. Nature Clim. 


Change 5, 709-712; 2015). The 


team says that access to clean 
water could be achieved for 


$190 billion, with sanitation and 
electricity costing $370 billion 


and $430 billion, respectively. 


Pachauri out 


Rajendra Pachauri, the 

former chairman of the 
Intergovernmental Panel on 
Climate Change (IPCC), has 
been replaced as director- 
general of The Energy and 
Resources Institute (TERI) 

in New Delhi, India. The 
action by TERIT’s governing 
council, announced on 23 July, 
comes on the heels of sexual- 
harassment allegations that cost 
Pachauri (pictured) his post at 
the IPCC in February. He will 
be replaced by Ajay Mathur, 

an energy official in the Indian 
government. Ina press release, 
TERI stated that the search for 
Pachauri’s replacement began 
in September 2014. 


Genome head quits 
The revered leader of a leading 
genomics research centre in 
China has abruptly stepped 
down. Jun Wang is leaving 


SHARING SUBSIDIES 


his post as chief executive of 
BGI in Shenzhen, the institute 
announced on 17 July, and 
will pursue research into 
artificial intelligence. Wang 
has worked at BGI since it 
opened in 1999. He is credited 
with leading some of its major 
accomplishments, including 
being the first to sequence the 
genome of an Asian person, 
the giant panda and the 
human gut microbiota. He 
also led contributions to the 
Human Genome Project and 
the initiative to sequence the 
rice genome. See go.nature. 
com/nhdkjs for more. 


Pesticide ban lifted 


A UK government agency has 
used emergency rules to make 
controversial neonicotinoid 
insecticides available to some 
farmers, despite a European 
ban imposed on the chemicals 
in 2013. Neonicotinoids have 
been linked to declines in 

bee populations in numerous 
scientific studies. The UK 
Department for Environment, 
Food & Rural Affairs 
authorized limited emergency 
use of two pesticides on 22 July 
after a request by farmers. The 
National Farmers’ Union says 
that the pesticides are needed 
to protect around 300 square 
kilometres of oilseed rape in 
England from cabbage stem 
flea beetles. See go.nature. 
com/ytp9q6 for more. 


The portion of national fossil-fuel subsidies that would need to be 
redistributed to provide clean water by 2030 is highest in China 


and in parts of Africa. 
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SEVEN DAYS | THIS WEEK | 


2-6 AUGUST 

The 27th International 
Congress for 
Conservation 

Biology and the 4th 
European Congress for 
Conservation Biology, 
join in Montpellier, 
France, to discuss ways 
to tackle conservation. 
go.nature.com/rsu6x3 


4-6 AUGUST 
Planetary scientists 
gather in Arcadia, 
California, to argue the 
case for their favourite 
Martian landing site. 
Around 30 candidate 
locations are in the 
running to be chosen for 
NASAs 2020 Mars rover. 
go.nature.com/x4wh7c 


3-14 AUGUST 
Honolulu, Hawaii, 

is awash with 3,500 
astronomers, when 
the International 
Astronomical Union 
holds its general 
assembly. 
go.nature.com/4uj4ia 


Malaria vaccine 


The world’s first vaccine 
against malaria can now 

be used routinely in 

children — but only in 
countries where the disease 
is endemic. On 24 July, 

the European Medicines 
Agency in London declared 
Mosquirix safe for use 

and moderately effective 
against the malaria parasite 
Plasmodium falciparum, in 
combination with established 
protective measures such as 
bed nets. The World Health 
Organization must now 
formally agree to recommend 
its use in children. Mosquirix 
has been developed by 
UK-based drug firm 
GlaxoSmithKline, and is also 
protective against hepatitis B. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 


30 JULY 2015 | VOL 523 | NATURE | 507 


© 2015 Macmillan Publishers Limited. All rights reserved 


RAUL ARBOLEDA/AFP/GETTY 


NEWSIN FOCUS 


ship to probe Greenland’s 


CLIMATE CHANGE NASA sends 
glaciers p.510 


looms for US science 


BUDGET DEADLINE Frustration 
agencies p.513 


COSMOLOGY ‘Half-pipe’ 
telescope prepares to 
tackle dark energy p.514 


SS STEM CELLS Mini kidneys, 
® livers, brains and guts are 


part of ‘organoid’ boom p.520 


Alzheimer’s disease is marked by cognitive decline and the accumulation of proteins in the brain. 


DRUG DEVELOPMENT 


Alzheimer’s drugs 
show progress 


Protein-targeting antibodies succeed after many failures. 


BY SARA REARDON 


disease have been frustrated on two 

fronts. They have struggled to understand 
whether the amyloid-f protein that accumulates 
in sufferers brains is a driver of the disease, or 
just a symptom. And without a clear under- 
standing of the condition’s cause, they have 
searched fruitlessly for effective treatments. 


| bs years, scientists studying Alzheimer’s 


The latest results from clinical trials of two 
antibody drugs could provide a path forward. 
For the first time, such drugs — which target 
amyloid — seem to have slowed the progres- 
sion of the disease. The findings, released on 
22 July at the Alzheimer’s Association Interna- 
tional Conference, support the idea that amy- 
loid deposits cause the mental deterioration 
seen in people with Alzheimer's. 

“We're creeping in the right direction,” says 
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Samuel Gandy, a neurobiologist at Mount Sinai 
School of Medicine in New York. “A lot of the 
euphoria is because things were so negative for 
so long Still, many researchers doubt whether 
the minor improvements reported last week 
will hold up in larger trials. 

Pharmaceutical firm Eli Lilly of Indianapolis, 
Indiana, says that in a trial with 440 participants, 
its drug solanezumab seemed to slow the cogni- 
tive decline of people with mild Alzheimer’s by 
about 30%. Over 18 months, people in the treat- 
ment group experienced a loss of mental acuity 
equivalent to the deterioration experienced in 
just 12 months by those ina placebo group with 
Alzheimer’s disease of similar severity. 

Lilly snatched this small victory from the 
jaws of defeat. In 2012, the company reported 
no difference between patients who had taken 
solanezumab for 18 months and those who 
had received a placebo. But when the com- 
pany reanalysed the trial data, it found a slight 
improvement in participants whose symptoms 
were mild when the trial began. Lilly continued 
the test for six months and began giving solan- 
ezumab to the 440-member control group, 
whose disease was by then more advanced. 

The latest results show that cognitive decline 
in the ‘late start’ group slowed to match the rate 
seen in the 440 people who had been treated 
for the entire study. This suggests that solane- 
zumab targeted the root of Alzheimer’s disease. 

Drug-maker Biogen, of Washington DC, 
presented results that show a moderate dose 
of its drug aducanumab reduced amyloid 
build-up in 23 people, but did not have statis- 
tically significant clinical benefits. In March, 
the company reported that 27 people who 
received high doses of aducanumab for one 
year showed significantly less cognitive decline 
than people who received a placebo, and had 
less amyloid in their brains. 

Many experts are greeting these results 
with tempered excitement, given the relatively 
small size of the clinical trials. But Eric Siem- 
ers, an Alzheimer’s researcher at Lilly, is more 
optimistic. “It's surprising to me that [solan- 
ezumab] worked so well,” he says. “There’s a 
lot of promise to slow progression.” 

Lilly launched a larger phase II trial of 
solanezumab in 2013, enrolling 2,100 people 
with mild symptoms and amyloid deposits in 
their brains. The study will end in October 2016. 
And last December, Biogen said that it would 
launch a phase III trial with 2,700 participants 
that would run for 18 months. > 
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> Lon Schneider, an Alzheimer’s 
researcher at the University of Southern 
California in Los Angeles, questions the 
decision to start large trials before the 
drugs, and the amyloid hypothesis, have 
been well validated. “Why are there so 
many antibodies when none so far have 
proven efficacy?” he asks, noting that 
behavioural interventions, such as diet and 
exercise, have been shown to slow Alzhei- 
mer’s as much as any drug (A. M. Clarfeld 
and T. Dwolatzky JAMA Intern. Med. 173, 
901-902; 2013). 

But not everyone agrees. “This is the 
time to be bold,’ says Randall Bateman, a 
neurologist at Washington University in 
St. Louis, Missouri. “It seems to me the cost 
of delay from a human-suffering standpoint 
is much more expensive than the cost of 
moving forward” 

Bateman is leading a trial that is testing 
solanezumab and ganetenerumab — devel- 
oped by Roche of Basel, Switzerland — in 
160 people between 18 and 80 years old 
who have a genetic risk of Alzheimer’s, but 
no symptoms. It is one of several efforts 
attempting to determine whether the disease 
can be prevented by destroying amyloid pro- 
tein before the brain is damaged. That harm 
occurs over decades (R. J. Bateman et al. 
N. Engl. J. Med. 367, 895-804; 2012), and 
many Alzheimer’s researchers suspect that 
antibody-drug trials have failed because they 
have treated people too late. 

This hypothesis is supported by Lilly’s 
finding that only people with mild dis- 
ease benefit from solanezumab. The latest 
results also demonstrate for the first time 
in humans that slowing amyloid deposi- 
tion can slow down cognitive decline, says 
Eric Reiman, executive director of the 
Banner Alzheimer’s Institute in Phoenix, 
Arizona. 

That is important because the US Food 
and Drug Administration has said that it 
will not approve drugs that block amyloid 
deposits without sufficient evidence of a 
clinical benefit. If one drug company can 
prove the cause and effect between amyloid 
accumulation and Alzheimer’s progression, 
all companies will benefit, says Reiman, 
who is leading a trial of crenezumab, a 
Roche drug that has also failed previously 
in large trials. 

Ifsuch drugs falter in larger preventative 
trials, that would be a setback for Alzhei- 
mer’s research in general, says Gandy. “The 
main concern is that the pipeline behind 
amyloid-reducing agents is really pretty 
spare,’ he says. However, at least three com- 
panies are developing treatments — some 
of which are antibody drugs — that target 
a different protein, tau, which destroys 
neurons in advanced Alzheimer’s disease. m 
See go.nature.com/lathpu for a longer 
version of this story. 
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GLACIOLOGY 


NASA launches 
mission to Greenland 


Ship and planes will probe water —ice interface in fjords. 


BY JEFF TOLLEFSON 


en the retired fishing trawler 
MV Cape Race sets off along 
Greenland’s west coast this week, it 


will start hauling ina scientific catch that prom- 
ises to improve projections of how the ice-cov- 
ered island will fare in a warming world. The 
ship’s cruise is the initial phase ofa six-year air 
and sea campaign to probe interactions between 
Greenland’s glaciers and the deep, narrow fjords 
where they come to an end. 

Called Oceans Melting Greenland (OMG), 
the US$30-million NASA project will help 
scientists to predict the future of the Green- 
land ice sheet, which holds enough water to 
boost sea levels by around 6 metres and already 
seems to be melting more rapidly in response 
to increasing air temperatures. But it is not clear 
how much the oceans affect the rate of melt- 
ing along the island’s edges, which depends on 
poorly known variables such as how warm, 
saline water interacts with the glaciers. 

“Tt should be a powerful constraint on our 
knowledge and ability to model ice loss there,” 
says principal investigator Joshua Willis, an 
oceanographer at NASAs Jet Propulsion Labo- 
ratory in Pasadena, California. 

When simulating glacier dynamics, current 
global climate models consider only ice’s 
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interactions with the atmosphere, says William 
Lipscomb, an ice modeller at Los Alamos 
National Laboratory in New Mexico. He is 
working to incorporate ice-ocean interactions 
around Antarctica into a climate model being 
developed by the US Department of Energy. But 
in Greenland, the intricately carved coastline 
makes this much more difficult. The depart- 
ment plans to give researchers at the Naval 
Postgraduate School in Monterey, California, 
$466,000 over 2 years to build a detailed model 
that will link the land ice and oceans around 
Greenland. OMG data will help to validate that 
model, says project leader Frank Giraldo. 

Work by OMG participant Eric Rignot, a 
glaciologist at the University of California, 
Irvine, underscores the importance of detailed 
data (E. Rignot et al. Geophys. Res. Lett. http:// 
doi.org/6dn; 2015). Using sonar data from one 
part of western Greenland, Rignot’s team found 
that existing maps underestimate the depth of 
three fjords by several hundred metres. It also 
found that glaciers flowing into all three fjords 
extended deeper than was thought, far enough 
below fresh surface waters to reach a warm, salty 
layer flowing up from the Atlantic Ocean that 
could accelerate melting and contribute more 
to sea-level rise than had been believed. 

“With OMG, we are going to reveal the 
depth of these fjords,’ says Rignot. 


MARIA STENZEL/UCI 


The programme will also provide valuable 
information on the physical characteristics of 
glacier ice. Last December, geophysicist Beata 
Csatho of the University at Buffalo in New York 
and her colleagues reported using surface- 
elevation data to estimate how much ice mass 
Greenland had lost between 1993 and 2012 
(B. M. Csatho et al. Proc. Natl Acad. Sci. USA 
111, 18478-18483; 2014). The data were fairly 
reliable over the island’s interior, Csatho says, 
but measurements were more difficult along its 


edges, where the ice tends to be warmer, thicker 
and full of crevices. “It’s still a challenge to get 
the mass of these glaciers,” she says. 

When the aerial phase of OMG begins next 
year, planes will fly inland from the coast, taking 
measurements of slight changes in gravitational 
pull that can be used to produce low-resolution 
maps of the topography under both water and 
ice. Planes will also drop more than 200 temper- 
ature and salinity probes into fjords and coastal 
waters, and take radar measurements along 
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the coast to track large-scale ice loss over five 
years. Analysing that ice loss in light of the new 
topographical and oceanographic data will help 
researchers to determine where, and to what 
extent, deeper saltwater currents affect glaciers. 
Lipscomb says that all these OMG data 
should help modellers as they incorporate 
ocean-ice interactions around Greenland 
into their models. That work is still in its early 
stages, he says, “but the data that they are get- 
ting in this project is exactly what we need”. m 


SOURCE: NASA 


PLANETARY SCIENCE 


Kepler spies most 
Earth-like planet yet 


NASA mission finds a potentially rocky world orbiting a star that resembles the Sun. 


BY ALEXANDRA WITZE 


ven as astronomers are reporting what 
Bie like the closest thing yet to an 

Earth-like exoplanet, NASA is winding 
down the prolific Kepler mission that made 
the find. Sometime next year, team scien- 
tists plan to release their final list of planet 
discoveries. 

On 23 July, the Kepler team announced the 
existence of a planet 1.6 times the size of Earth, 
orbiting a Sun-like star 430 parsecs away 
(J. M. Jenkins et al. Astron. J. 150, 56,2015). 
The planet, named Kepler-452b, is in the hab- 
itable zone, orbiting its star at a distance where 
liquid water could exist. Team scientists say 
that there is a little more than a 50% chance 


HABITABLE HUNT 


that the planet is rocky, which would make it 
the closest thing to a true Earth analogue yet 
discovered. 

Kepler’s latest batch of discoveries also 
includes at least 11 other planets that are all 
less than twice the size of Earth and orbit in 
their stars’ habitable zones. But Kepler-452bs 
star is slightly brighter than the Sun, in contrast 
to the cool, dim stars that host other known 
Earth-sized planets. 

“Tt is the first terrestrial planet in the habit- 
able zone around a star very similar to the Sun,” 
says Douglas Caldwell, an astronomer at the 
SETI Institute in Mountain View, California. 

Scientists cannot measure the mass of 
Kepler-452b directly, but modelling suggests 
that the planet is probably five times the mass 


NASA's Kepler mission has found several small planets outside the Solar System that could harbour 
life. They orbit their stars in the ‘habitable zone’ where liquid water could exist. 
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of Earth. It whirls around its star once every 
385 days, tantalizingly close to Earth's 365-day 
year (see “Habitable hunt’). 

The planet’s star is about 1.5 billion years 
older than the 4.5-billion-year-old Sun, and 
Kepler-452b is about the same age. During 
its first 5 billion years, the planet would have 
received less energy from its star than Earth 
does from the Sun, but it may now offer a 
glimpse of Earth’s future. Kepler-452b’s star is 
growing hotter and brighter as part of its natu- 
ral evolution, so anyone living on the planet 
would see their world drying out — just as 
Earth will as the Sun evolves. 

From 2009 to 2013, the Kepler craft stared at 
a small patch of sky looking for slight decreases 
in starlight that signalled a planet moving, or 
transiting, across the face of its star. It stopped 
taking data when it was crippled by a broken 
reaction wheel that guided its pointing, but 
engineers later revived it in a limited fashion. 

The craft has discovered more than 
1,000 confirmed planets, including Kepler- 
452b, and more than 4,660 candidates. 

In the next year, Kepler scientists will per- 
form analyses to reduce the signal-to-noise 
ratio in their data, teasing out as many planets 
as possible. “Imagine youre trotting through 
the weeds in a field, looking for precious 
stones on the ground,” says Natalie Batalha, 
Kepler’s mission scientist and an astronomer 
at NASA’s Ames Research Center in Moffett 
Field, California. “We're going through with a 
lawnmower so that the stones are easier to see.” 

In January, the European Southern Observa- 
tory began a search for transiting planets from 
its telescopes in Chile. NASA plans to launch 
a space-based successor to Kepler called the 
Transiting Exoplanet Survey Satellite in 2017. m 
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ANTHROPOLOGY 


Neanderthals had outsize 
effect on human biology 


From skin disorders to the immune system, sex with archaic species changed Homo sapiens. 


BY EWEN CALLAWAY 


ur ancestors were not a picky bunch. 
()erervisinin genetic evidence 

shows that Homo sapiens had sex with 
Neanderthals, Denisovans and other archaic 
relatives. Now researchers are using large 
genomics studies to unravel the decidedly mixed 
contributions that these ancient romps made to 
human biology — from the ability of H. sapiens 
to cope with environments outside Africa, to the 
tendency of modern humans to get asthma, 
skin diseases and maybe even depression. 

The proportion of the human genome that 
comes from archaic relatives is small. The 
genomes of most Europeans and Asians are 
2-4% Neanderthal’, with Denisovan DNA 
making up about 5% of the genomes of Mela- 
nesians’ and Aboriginal Australians’, DNA 
slivers from other distant relatives probably 
pepper a variety of human genomes’. 

But these sequences may have had an out- 
size effect on human biology. In some cases, 
they are very different from the correspond- 
ing H. sapiens DNA, notes population geneti- 
cist David Reich of Harvard Medical School 
in Boston, Massachusetts — which makes it 
more likely that they could introduce use- 
ful traits. “Even though it’s only a couple or 
a few per cent of ancestry, that ancestry was 
sufficiently distant that it punched above its 
weight,” he says. 

Last year, Reich co-led one of two teams that 
catalogued the Neanderthal DNA living on in 
modern-day humans”®. The studies hinted that 
Neanderthal versions of some genes may have 
helped Eurasians to reduce heat loss or grow 
thicker hair. But the evidence that these genes 
were beneficial was fairly weak. 

To get a better handle on how Neanderthal 
DNA shapes human biology, Corinne Simonti 


A gene variant from archaic humans helps modern-day Tibetans to cope with high altitudes. 


and Tony Capra, evolutionary geneticists at 
Vanderbilt University in Nashville, Tennessee, 
turned to genome-wide association studies 
(GWAS) that had already compared thousands 
of DNA variants in people with and without a 
certain disease or condition. 

Using de-identified genome data and medical 
records from 28,000 hospital patients, Simonti 
and Capra looked for differences in traits and 
medical diagnoses between people with a par- 
ticular Neanderthal gene variant and those with 
the H. sapiens version of the same gene. They 
found that the Neanderthal variants seemed 
to slightly increase the risk of conditions such 
as osteoporosis, blood-coagulation disorders 
and nicotine addiction. Another analysis, 
which looked at the combined effects of many 
DNA variants, painted a more mixed picture. 
It revealed links between Neanderthal DNA 


and depression, obesity and certain skin disor- 
ders, with some variants being associated with 
an increased risk and others with a reduced 
risk. Simonti presented the data at the annual 
meeting of the Society for Molecular Biology 
and Evolution in Vienna on 15 July. 
Neanderthal gene variants, like most human 
variants, had only a tiny effect on the risk of 
developing these conditions, notes Capra. But 
seeing Neanderthal genes involved in skin 
disorders, including lesions triggered by sun 
exposure, chimes with previous studies link- 
ing Neanderthal DNA to skin biology, he says. 
In some cases, the effects of the archaic genes 
may have changed over the ages. Simonti and 
Capra also reported at the Vienna meeting that 
blood-coagulation disorders experienced by 
modern humans could be related to Neander- 
thal immune genes, although previous studies 
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have suggested that archaic immune genes may 
have helped H. sapiens to cope with diseases 
that they encountered outside Africa. 

Also at the meeting, a team led by Michael 
Dannemann, a computational biologist at 
the Max Planck Institute for Evolutionary 
Anthropology in Leipzig, Germany, reported 
that many humans have Neanderthal and 
Denisovan versions of genes that encode pro- 
teins called toll-like receptors (TLRs), which 
sense pathogens and launch a rapid immune 
response. Furthermore, cultured human cells 
containing the archaic versions tended to 
express the TLRs at higher levels than cells with 
the H. sapiens versions’. Although previous 
GWAS linked the archaic versions to a reduced 


risk of Heliobacter pylori infection, which can 
cause stomach ulcers, the variants were also 
associated with higher rates of allergies. 

“Many traits that were adaptive 10,000 years 
ago might be maladaptive today” because of 
lifestyle, diet and other shifts, notes Rasmus 
Nielsen, a population geneticist at the Univer- 
sity of California, Berkeley. 

At least one archaic trait has clear benefits 
in contemporary humans. Last year, Nielsen's 
team reported that the Denisovan-like version 
of a gene called EPAS1 helps modern Tibetans 
to cope with life at altitudes of 4,000 metres, by 
preventing their blood from thickening®. 

Many researchers see Nielsen’s EPAS1 
discovery as the poster child for humans’ 
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archaic biology, because the benefits of the 
Denisovan version are so clear-cut. But proving 
such insights rests on laborious studies, includ- 
ing engineering mice to carry the archaic 
mutations and exhaustively testing the animals’ 
biology, notes Reich. “Each new finding is going 
to be very hard won?’ = 
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Budget showdown leaves 
US science agencies in limbo 


Lawmakers face looming deadline to reach a deal — or risk government shutdown. 


BY CHRIS CESARE 


hen the US Congress returns from 
its late-summer recess in early 
September, lawmakers and Presi- 


dent Barack Obama will have less than three 
weeks to reach a budget deal for 2016, and in 
doing so determine the funding of key science 
agencies. 

The most likely scenario, experts say, is that 
a temporary deal will be made to keep the gov- 
ernment operating for weeks or months after 
the 2016 fiscal year begins on 1 October. That 
is cold comfort for US science agencies and 
researchers who have endured years of bruis- 
ing partisan spending battles. 

“We're basically headed into a period of frus- 
tration where nothing’s going to happen for a 
couple months, and we're just going to have 
to deal with it” says Jennifer Zeitzer, direc- 
tor of legislative relations at the Federation of 
American Societies for Experimental Biology 
in Bethesda, Maryland. 

The current funding agreement expires on 
30 September. And, in protest against legisla- 
tors’ embrace of the across-the-board budget 
cuts known as sequestration, Obama has 
threatened to veto many of the 2016 spending 
bills introduced by the Republican-controlled 
House of Representatives and Senate. 

Some agencies, such as the National 
Institutes of Health (NIH), seem likely to 
emerge as winners in any deal (see ‘Budget 
battle’). The House has proposed increas- 
ing the NIH budget by US$1.1 billion in 


How funding proposals from the White House and Congress stack up, by agency (US$ millions). 


BUDGET BATTLE 

Agency 2015 estimated 
budget 

Biomedical research and public health 

National Institutes of Health 30311 

Centers for Disease Control 6,073 

and Prevention 

Food and Drug 2,596 

Administration 

Physical sciences 

National Science Foundation 7,344 

NASA (science) 5,245 

Department of Energy Office 5,068 

of Science 

National Institute of 864 

Standards and Technology 

Earth and environment 

Environmental Protection 8,140 

Agency 

National Oceanic and 5,449 

Atmospheric Administration 

US Geological Survey 1,045 


2016, to $31.2 billion; the Senate’s proposal 
of $32.1 billion is even more generous. And 
both are close to the White House proposal 
of $31.3 billion. 

But for other agencies and programmes, 
the prospects are not so clear. The House 
matched Obama's $18.5-billion request for 
NASA, and the Senate is close at $18.3 billion. 
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2016 White 2016 House 2016 Senate 
House request bill bill 
chen 31,184 32,084 
6,170 7,010 6,711 
2,744 2,619 2,629 
7,724 7,394 7,344 
5,289 5,238 295) 
5,340 5,100 5,100 
1,120 855 893 
8,592 7 A22 7,597 
5,983 5,167 5,382 
iL Ales) 1,045 MO59) 


But the House wants to chop $101 million 
from the space agency's $1.8-billion Earth- 
sciences account, which funds research on 
topics such as climate change. The Senate 
would boost the wing’s funding by roughly 
$148 million. Similarly, the House bill would 
set aside $5.2 billion for the National Oce- 
anic and Atmospheric Administration > 
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> (NOAA), 5% less than the current level 
and about $800 million short of Obama's 
request. The Senate bill would reduce 
NOAA spending by just 1%. 

But it is the National Science Founda- 
tion (NSF) that has most polarized law- 
makers. The House’s NSF spending bill 
would require the agency to award 70% 
of its $6-billion research fund to biology, 
computer science, engineering, mathemat- 
ics and the physical sciences. The unusual 
provision would effectively impose a 16% 
cut to geoscience and social-sciences 
programmes, according to an analysis by 
the American Institute of Physics. By con- 
trast, the Senate's bill does not set funding 
levels for particular disciplines. 


BASIC FOCUS 

Powerful House Republicans, most notably 
science-committee chair Lamar Smith of 
Texas, have argued that the NSF should 
concentrate on basic research. Smith has 
also tried to highlight what he sees as ques- 
tionable grants by the science agency, such 
as funding for a study of mental health in 
Nepal. But Gloria Waters, vice-president 
and associate provost for research at Bos- 
ton University in Massachusetts, says that 
legislators often misunderstand the role of 
basic science. “People have this idea that 
science funding should go to something 
that should have an immediate and direct 
impact on society, but that’s not how sci- 
ence works,” she says. 

Deciding which projects to fund is 
made more difficult by a lack of money, 
says Hannah Carey, a physiologist at the 
University of Wisconsin-Madison. “I’ve 
experienced it — you put in a grant to 
continue your work that gets a very, very 
good score and would have been funded 
ina better climate,’ says Carey, who spent 
a year working as a programme director 
in the NSF’s biosciences division. “It’s 
disheartening” 

A short-term spending deal would avert 
a government shutdown of the sort that 
ground most research to a halt in October 
2013. But a stopgap arrangement could 
still make life difficult for researchers. 
Such deals generally prevent agencies 
from starting new programmes or end- 
ing old ones without specific authoriza- 
tion from Congress. And agencies can 
face unexpected budget shortfalls if law- 
makers agree to cut spending after months 
of operating under a temporary funding 
agreement, because the cuts would be ret- 
roactive. 

For now, scientists are left to wait while 
the negotiations between Congress and the 
White House play out. Zeitzer says that a 
deal may not be struck until the last min- 
ute. “I'm hearing there’s a real good chance 
they'll take us to the brink,” she says. = 
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The CHIME telescope array will search for a particular kind of hydrogen emission from ancient galaxies. 


COSMOLOGY 


Half—pipe array to 
map teen Universe 


Canadian telescope aims to chart cosmic expansion rate 
between 10 billion and 8 billion years ago. 


BY DAVIDE CASTELVECCHI 


observatory shaped like the half-pipes 
used by snowboarders, and dependent on 
technology originally designed for gaming and 
mobile phones, will soon be tasked with plug- 
ging a crucial gap in the cosmological record: 
what the Universe did when it was a teenager. 

The information will allow cosmologists to 
gauge whether the strength of dark energy — 
the force accelerating the Universe's expansion 
— has changed over time, an unresolved ques- 
tion that governs the fate of the cosmos. 

Whereas typical radio telescopes have round 
dishes, the Canadian Hydrogen Intensity Map- 
ping Experiment (CHIME) comprises four 
100-metre-long, semi-cylindrical antennas, 
which lie near the town of Penticton in British 
Columbia. 

From 2016, CHIME’s half-pipes, which 
are scheduled to be completed this week, will 
detect radio waves emitted by hydrogen in 
distant galaxies. These observations would 
be the first measurements of the Universe’s 


I: sounds almost too apt to be true. An 
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expansion rate between 10 billion and 8 billion 
years ago, a period in which the cosmos went 
“from being a kid to an adult”, says Mark 
Halpern, the leader of CHIME and an experi- 
mental cosmologist at the University of Brit- 
ish Columbia in Vancouver. Straight after the 
Big Bang 13.8 billion years ago, the rate of the 
Universe's expansion slowed. But somewhere 
during the ‘adolescent’ period, dark energy — 
which eventually turned the Universe's slowing 
expansion into the acceleration observed today 
— began to be felt, he says. 

Itis a window in time that has, until now, been 
closed. Cosmologists measure the Universe's 
past expansion rate using ancient objects, such 
as supernova explosions and the voids between 
galaxies, that are so distant that their light is only 
now reaching Earth. Over the past few decades, 
such objects have revealed that the cosmos has 
been expanding at an accelerating rate for more 
than 6 billion years. And surveys of quasars — 
mysterious, super-bright objects that outshine 
the entire galaxies they lie in — have shown that 
until 10 billion years or so ago, the Universe's 
expansion was slowing down. 


MARK HALPERN/CHIME COLLABORATION 


But cosmologists have struggled to measure 
the expansion rate in the interim, leaving open 
the question of whether the strength of dark 
energy’s repulsive force may have varied over 
time. 

CHIME is designed to fill the gap, says 
Kendrick Smith, an astrophysicist at the Perime- 
ter Institute for Theoretical Physics in Waterloo, 
Canada, who will work on analysing CHIME’s 
data. The half-pipe antennas will allow CHIME 
to receive radio waves coming from anywhere 
along a narrow, straight region of the sky at any 
given time. “As the Earth rotates, this straight 
shape sweeps out the sky,” says Smith. 

To sort out where individual signals are 
coming from, a custom-built supercomputer 
made of 1,000 relatively cheap graphics- 
processing units — the type used for high- 
end computer gaming — will crunch through 
nearly 1 terabyte of data per second. The 
team will also use signal amplifiers originally 
developed for mobile phones. Without such 
powerful consumer-electronics components, 
CHIME would have been prohibitively expen- 
sive, says experimental cosmologist Keith 
Vanderlinde of the University of Toronto, 
Canada, who is co-leading the project. 

CHIME'’s supercomputer will look specifi- 
cally for radio waves with a wavelength that 
suggests an age of 11 billion to 7 billion years, 
emitted by the hydrogen in the interstellar space 


inside galaxies (at their source, such emissions 
have a wavelength of 21 centimetres). Research- 
ers will then subtract the ‘radio noise’ in the 
same wavelength range that comes from the 
Milky Way and Earth. 

Although CHIME will not be able to distin- 
guish individual galaxies in this way, clumps 
of hundreds or thousands of galaxies will show 

up, says Vanderlinde. 


“As the Earth This will allow 
rotates, this researchers to map 
straight shape the expansion rate 
sweeps out of the voids between 
the sky.” the clumps, and in 


turn to calculate the 
strength of dark energy during that time. 

If the results imply that the strength of dark 
energy then was the same as it has been in the 
past 6 billion years, it could suggest that galax- 
ies will eventually lose sight of each other. But 
if the strength of dark energy has changed over 
the eons, all bets are off: the Universe could 
collapse in a ‘big crunch; for example, or be 
ripped apart into its subatomic components. 

As well as mapping the adolescent Uni- 
verse, CHIME could also detect hundreds of 
the mysterious ‘fast radio bursts’ that last just 
milliseconds and have no known astrophysi- 
cal explanation. And it will help other experi- 
ments to calibrate measurements of radio 
waves from rapidly spinning neutron stars, 
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which researchers hope to use to detect the 
ripples in space-time known as gravitational 
waves (see Nature 463, 147; 2010). 

CHIME is part of a growing trend in astron- 
omy. A number of experiments that are now 
active or in the planning stages, including the 
hotly anticipated Square Kilometer Array — to 
be built on sites in Australia and South Africa 
— are designed to look at hydrogen emissions 
with 21-centimetre wavelengths. These emis- 
sions are an untapped trove of cosmological 
information, says Tzu-Ching Chang, an astro- 
physicist at the Academia Sinica Institute of 
Astronomy and Astrophysics in Taipei who 
helped to pioneer the hydrogen mapping of gal- 
axies in a 2010 study (T:-Z. Chang et al. Nature 
466, 463-465; 2010). She likens the boom in 
hydrogen mapping today to the trend in the 
1990s of studying the relic radiation of the Big 
Bang, which revolutionized cosmology. m 


CLARIFICATION 

The News story ‘US tailored-medicine 
project aims for ethnic balance’ (Nature 523, 
391-392; 2015) implied that the plan for 
the whole Precision Medicine Initiative is 
due to be announced in the next few weeks. 
Actually, the forthcoming plan is just for the 
cohort-study component of the project. 
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The trouble with 


An easy method that promised to save lives in hospitals 


worldwide may not be so simple after all. 


BY EMILY ANTHES 


efore making the first incision, confirm the patient’s identity. 

Mark the surgical site. Ask about allergies. Discuss any antici- 

pated blood loss. Introduce yourself by name. These are some of 

the 19 tasks on the World Health Organization (WHO) Surgical 

Safety Checklist, a simple list of actions to be completed before 
an operation in order to cut errors and save lives. 

In 2007 and 2008, surgical staff at eight hospitals around the world 
tested the checklist in a pilot study’. The results were remarkable. Com- 
plications such as infections after surgery fell by more than one-third, 
and death rates dropped by almost half. The WHO recommended that 
all hospitals adopt its checklist or something similar, and many did. 
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Many hospitals have 
introduced pre-surgery 
checklists, with mixed 
results. 


T. C. WORLEY/NYT/REDUX/EYEVINE 


The UK National Health Service (NHS) immediately required all of 
its treatment centres to put the checklist into daily practice; by 2012, 
nearly 2,000 institutions worldwide had tried it. The idea of checklists 
as a simple and cheap way to save lives has taken hold throughout the 
clinical community. It has some dynamic champions, including Atul 
Gawande, a surgeon at Brigham and Women’s Hospital in Boston, Mas- 
sachusetts, who led the pilot study and has spread the word through 
talks, magazine articles and a best-selling book, The Checklist Manifesto 
(Metropolitan, 2009). 

But this success story is beginning to look more complicated: some 
hospitals have been unable to replicate the impressive results of initial 
trials. An analysis of more than 200,000 procedures at 101 hospitals in 
Ontario, Canada, for example, found no significant reductions in com- 
plications or deaths after surgical-safety checklists were introduced”. 
“We see this all the time,’ says David Urbach, a surgeon at the University 
of Toronto who led the Ontario analysis. “A lot of studies that should be a 
slam dunk don’t seem to work in practice.” The stakes are high, because 
poor use of checklists means that people may be dying unnecessarily. 

A cadre of researchers is working to make sense of the discrepan- 
cies. They are finding a variety of factors that can influence a checklist’s 
success or failure, ranging from the attitudes of staff to the ways that 
administrators introduce the tool. The research is part of the growing 
field of implementation science, which examines why some innova- 
tions that work wonderfully in experimental trials tend to fall flat in the 
real world. The results could help to improve the introduction of other 
evidence-based programmes, in medicine and beyond. 

“We need to learn the lessons from programmes and interventions 
like the checklist so we don't make the same mistakes again,” says Nick 
Sevdalis, an implementation scientist at King’s College London. 


REPLICATION FRUSTRATION 


One of the first to demonstrate the potential of checklists in health care 
was Peter Pronovost, an anaesthesiologist and critical-care physician 
at Johns Hopkins University School of Medicine in Baltimore, Mary- 
land. In 2001, Pronovost introduced a short checklist for health-care 
workers who insert central venous catheters, or central lines, which are 
often used in an intensive care unit (ICU) to test blood or administer 
drugs. The trial showed that asking practitioners to confirm that they 
had performed certain simple actions, such as washing their hands and 
sterilizing the insertion site, contributed to a dramatic reduction in the 
risk of life-threatening infections’. The list got a larger test in a now- 
famous trial* known as the Keystone ICU project, launched in Michigan 
in October 2003. Within 18 months, the rate of catheter-related blood- 
stream infections fell by 66%. 

Checklists were not completely new to medicine, but Pronovost’s 
work attracted attention because it suggested that they could save lives. 
Gawande penned an inspiring feature in The New Yorker’, asking: “If 
something so simple can transform intensive care, what else can it do?” 
Checklists began to proliferate. Now there are checklists for procedures 
involving anaesthesia, mechanical ventilation, childbirth and swine flu. 
Many studies have generated promising results, showing that the lists 
improve patient outcomes in hospitals from Norway to Iran. 

But there have also been some failures. This January, less than a year 
after the report from Ontario, a different team of scientists reported® 
that a surgical checklist modelled on Pronovost’s list did not improve 
outcomes at Michigan hospitals. And although the central-line checklist 
for ICUs has provided lasting benefits in Michigan, a British initiative 
called Matching Michigan, which aimed to replicate the Keystone pro- 
gramme, seemed to make no difference to infection rates’. 

Some experts suspect that the failure to replicate could be a matter 
of how the initial trials or the follow-up studies were designed. 
Gawande’s pilot study of the WHO surgical checklist, for example, 
was not randomized and had no control group. Instead, it compared 
complication and death rates before and after the checklist was intro- 
duced. Critics say that this makes it difficult to determine what other 
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factors might have influenced outcomes. 

Gawande acknowledges the limitation, which was due to cost restric- 
tions, but he points out that many subsequent trials, including ones that 
were randomized, have also demonstrated large reductions in complica- 
tions and mortality following the introduction of the checklist. The list 
works, he says — as long as it is implemented well. “It turns out to be 
much more complex that just having the checklist in hand” 


TICKING BOXES 


Implementation scientists are trying to make sense of that complexity. 
After the NHS mandated the WHO checklist, researchers at Imperial 
College London launched a project to monitor the tool’s use, and found 
that staff were often not using it as they should. In a review of nearly 
7,000 surgical procedures performed at 5 NHS hospitals, they found 
that the checklist was used in 97% of cases, but was completed only 
62% of the time®. When the researchers watched a smaller number of 
procedures in person, they found that practitioners often failed to give 
the checks their full attention, and read only two-thirds of the items out 
loud”. In slightly more than 40% of cases, at least one team member was 
absent during the checks; 10% of the time, the lead surgeon was missing. 

Going through all the steps in the list really mattered, the research 
showed. The more of the checklist that teams completed, the lower the 
complication rates. Several other studies have also revealed that higher 
compliance with the checklist is associated with better outcomes. 

“Tf it’s used well, if it’s used in the original spirit and intention with 
which it was designed, I think it has real potential,” says Sevdalis, who 
was part of the Imperial College research team. “If it’s used for people to 
tick the box and say, ‘Oh yes, we've done it; but without really thinking 
about the patient, without really informing their team members about 
aspects of the procedure that are relevant to them, I don’t think the 
checklist will make any difference.” 

To find out why checklists were not being used properly, Sevdalis and 


“WHEN IT WAS INTRODUCED 
WITHOUT ANY PROGRAMME 
OR SUPPORT, IT WAS JUST 
IMPOSSIBLE, | THINK, FOR 
TEAMS 10 BUY INTO IT.” 


his colleagues interviewed more than 100 members of operating-theatre 
staff at 10 NHS hospitals’®. Half of the respondents reported that sen- 
ior surgeons and anaesthesiologists sometimes actively resisted the 
checklists, making it difficult for the rest of the team to complete the 
tasks. Staff also complained about the checklist itself: that it was poorly 
worded, time-consuming, inappropriate for certain procedures or 
redundant with other safety checks. Some also questioned whether there 
were enough data to support the checklist’s use (see “Why checklists fail’). 

About one-quarter of the respondents objected to how the check- 
list had been introduced. Although some hospitals provided training 
and solicited feedback from staff, at other institutions there was little 
involvement from those actually working in the operating theatre. That 
strategy might make it difficult for staff to feel invested in the check- 
list, and ultimately undermine its correct use. “When it was introduced 
without any programme or support, it was just impossible, I think, for 
teams to buy into it,’ says psychologist Stephanie Russ, who was part of 
the research team and is now at the University of Aberdeen, UK. 
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Mary Dixon-Woods, a medical 
sociologist at the University of Leicester, 
UK, interviewed staff members at 17 
of the ICUs participating in Matching 
Michigan’. She found that by the time 
the programme began, British hospitals 
had already been involved in numer- 
ous government-led efforts to reduce 
infections. The checklist, she says, was 
viewed as “yet another example of these 
top-down, intrusive, imposed initia- 
tives”. It became “something that had 
to be endured rather than enjoyed”. In 
Michigan, by contrast, the tool was con- 
sidered new and exciting. And it was not 
imposed by the government — it was 
organized by the well-regarded state 
hospital association, and participation 
was voluntary. 

Dixon-Woods did identify one 
exemplary ICU, in which a high infec- 
tion rate fell to zero after Matching 
Michigan began. The unit was led by 
a charismatic physician who champi- 
oned the checklist and rallied others 
around it. “He formed coalitions with 
his colleagues so everyone was singing 
the same tune, and they just committed 
as a whole unit to getting this problem 
under control,’ says Dixon-Woods. 

Other work has also found that it 
might be helpful to enlist local champi- 
ons who can promote an intervention 
within a hospital, and some have hinted 
at how to get colleagues on board. Ina 
2011 study” of five hospitals in Wash- 
ington state, Gawande and his colleagues 
found that it is crucial that leaders take 
the time to explain how to use the check- 
list and why it should be used. “That might 
have included pulling on somebody’s heart 
strings, it might have included sharing as much evidence as possible, 
it might have included talking through the theoretical story or giving 
some important example,’ says Sara Singer, a health-policy researcher 
at the Harvard T. H. Chan School of Public Health in Boston, Massa- 
chusetts, who co-authored the study. 


A LOCAL LIST 


Experts also recommend that hospitals modify standard checklists to 
help the tool fit into the local workflow and to produce a feeling of 
investment and ownership. Pronovost encouraged the ICUs that par- 
ticipated in the Keystone project to make his checklist their own. “They 
were 95% the same, but that 5% made it work for them,” he says. “Every 
one of these hospitals thought that theirs was the best.” 

Pronovost and Dixon-Woods also think that several other factors 
contributed to the success in Michigan ICUs. Providing the hospitals 
with regular feedback on their infection rates created social pressure 
for improvement, they say, and regular in-person workshops allowed 
staff from different hospitals to share their experiences and created the 
sense of a shared mission. 

Beyond that, logistics are crucial. When Pronovost was first develop- 
ing his checklist at Johns Hopkins, he noticed that ICU doctors had to 
go to eight different places to collect all the supplies they needed to per- 
form a sterile central-line insertion. As part of the Keystone programme, 
hospitals assembled carts that contained all the necessary supplies. 
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WHY CHECKLISTS FAIL 


Operating - theatre staff at ten UK hospitals 
were interviewed about the barriers to 
implementing the World Health Organization 
surgical checklist. The biggest problems were: 


Staff resisted or failed to complete the checklist. 


“When the surgeons weren’t on 
board you were told to ‘Oh shut : 
up and let’s get on with it.’” says Dixon-Woods. 


The checklist was inappropriate or illogical. 


The checklist was thought to waste time. 


“Yet more delay! Oh gosh, 
we’re going to get less work 
done for the patients.” 


Ina 2013 study”, Dixon-Woods found 
that an African hospital using the WHO 
surgical checklist had regular shortages 
of the basic tools — such as surgical 
markers, antibiotics and pulse oxime- 
ters — that are required to complete the 
list. But the staff often ticked those boxes 
anyway; as one anaesthetist pointed out, 
it was often better for a patient to undergo 
surgery without these supplies than not 
to have surgery at all. If the checklist is 
going to succeed in low-income settings, 
these problems have to be addressed. 
“There's no point in having an item that 
says, ‘Have the antibiotics been given?’ if 
there are no antibiotics in the hospital” 


a7 


The clear lesson for hospital leaders 
is that they cannot just dump a stack of 
checklists in an operating room — they 
must observe them being used. Are team 
members all present? Are they rushing, 
or skipping steps? If so, then the lapses 
should be discussed and addressed. 

Implementation researchers say that 
the checklist story may hold lessons for 
the introduction of other programmes in 
fields including medicine, education and 
social work. “We have this massive influx 
of money to develop innovations,’ says 
Dean Fixsen, who co-founded the US 
National Implementation Research Net- 
work at the University of North Carolina 
at Chapel Hill. “But the track record of 
getting that science into practice where it 
actually produces the kinds of outcomes 
that we want to see — that track record 
is abysmal. Over the past few decades, 
researchers have published countless 

papers on evidence-based literacy pro- 
grammes and teaching strategies. And yet 
literacy rates for US nine-year-olds, for instance, have barely budged. 

Fortunately, Fixsen says, the lessons of implementation science are 
“completely generalizable’, and all programmes could benefit by noting 
the importance of engaged leadership, local adaptation and user buy- 
in. “It doesn’t matter how good the innovation is, it doesn’t matter how 
much has been invested,” says Fixsen. “If we don’t have the implementa- 
tion savvy, we're going to get the crummy outcomes that we have seen 
decade after decade?” m 


34% 


“It's a bit bizarre and there's 
asense of, I'm not actually 
progressing the patient care 
with this question.” 


Emily Anthes is a freelance journalist in New York City. 
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RISE OF THE ORGANOIDS 


Biologists are 
building banks of 
mini-organs, and 
learning a lot about 
human development 
on the way. 


BY CASSANDRA 
WILLYARD 
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t was an otherwise normal day in November when Madeline Lancaster 
realized that she had accidentally grown a brain. For weeks, she had 
been trying to get human embryonic stem cells to form neural rosettes, 
clusters of cells that can become many different types of neuron. But for 
some reason her cells refused to stick to the bottom of the culture plate. 
Instead they floated, forming strange, milky-looking spheres. 

“T didn’t really know what they were,’ says Lancaster, who was then 
a postdoc at the Institute of Molecular Biotechnology in Vienna. That 


day in 2011, however, she spotted an odd dot of pigment in one of her spheres. Looking under 
the microscope, she realized that it was the dark cells of a developing retina, an outgrowth of 
the developing brain. And when she sliced one of the balls open, she could pick out a variety of 
neurons. Lancaster realized that the cells had assembled themselves into something unmistak- 
ably like an embryonic brain, and she went straight to her adviser, stem-cell biologist Jiirgen 
Knoblich, with the news. “I’ve got something amazing,’ she told him. “You've got to see it.” 
Lancaster and her colleagues were not the first to grow a brain in a dish. In 2008, researchers 
in Japan reported' that they had prompted embryonic stem cells from mice and humans to form 
layered balls reminiscent ofa cerebral cortex. Since then, efforts to grow stem cells into rudimen- 
tary organs have taken off. Using carefully timed chemical cues, researchers around the world 


have produced three-dimensional structures that resemble tissue from the 
eye, gut, liver, kidney, pancreas, prostate, lung, stomach and breast. These 
bits of tissue, called organoids because they mimic some of the structure 
and function of real organs, are furthering knowledge of human develop- 
ment, serving as disease models and drug-screening platforms, and might 
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eventually be used to rescue damaged organs (see “The organoid bank). 
“It's probably the most significant development in the stem-cell field in 
the last five or six years,” says Austin Smith, director of the Wellcome 
Trust/MRC Stem Cell Institute at the University of Cambridge, UK. 

The current crop of organoids isn't perfect. Some lack key cell types; 
others imitate only the earliest stages of organ development or vary 
from batch to batch. So researchers are toiling to refine their organoids 
— to make them more complex, more mature and more reproducible. 
Still, biologists have been amazed at how little encouragement cells 
need to self-assemble into elaborate structures. “It doesn’t require any 
super-sophisticated bioengineering,” says Knoblich. “We just let the 
cells do what they want to do, and they make a brain” 


GROWING A GUT 


This shouldn't come as a major surprise, says 
molecular biologist Melissa Little at the Uni- 
versity of Queensland, Australia. “The embryo 
itself is incredibly able to self-organize; it doesn't 
need a template or a map.” That has been known 
since the early 1900s, when embryologists 
showed that sponges that had been broken up 
into single cells could reassemble themselves. But such work fell out of 
fashion, and modern biologists have focused their attention on purify- 
ing cells and growing them in culture — often in flat layers that do little 
to mimic normal human tissue. 

Studying these cells to understand how an organ functions is like 
studying a pile of bricks to understand the function of a house, says 
Mina Bissell, a cancer researcher at the Lawrence Berkeley National 
Laboratory in California. “We should just begin to make the house,” 
she says. Bissell’s work on cultures of breast cells helped to propagate 
the idea that cells behave differently in 3D cultures than in conven- 
tional flat ones. By the mid-2000s, the idea was catching on. The burst 
of enthusiasm was fuelled by Yoshiki Sasai, a stem-cell biologist at the 
RIKEN Center for Developmental Biology in Kobe, Japan, who turned 
heads when he grew a cerebral cortex’, followed bya rudimentary optic 
cup’ and pituitary gland’ (see Nature 488, 444-446; 2012). 

Just a year after Sasai announced his layered cortex, Hans Clevers, a 
stem-cell researcher at the Hubrecht Institute in Utrecht, the Nether- 
lands, reported the creation of a mini-gut*. The breakthrough stemmed 
from a discovery in 2007, when Clevers and his colleagues had identi- 
fied intestinal stem cells in mice. In the body, these cells seemed to 
have an unlimited capacity to divide and replenish the intestinal lining, 
and one of Clevers’ postdocs, Toshiro Sato, was tasked with culturing 
them in the lab. 

Rather than growing the cells flat, the pair decided to embed them in 
matrigel, a soft jelly that resembles the extracellular matrix, the mesh 
of molecules that surrounds cells. “We were just trying things,” Clevers 
says. “We hoped that we would make maybe a sphere or a blob of cells.” 
Several months later, when Clevers put his eye to Sato’s microscope, he 
saw more than blobs. The cells had divided, differentiated into multiple 
types, and formed hollow spheres that were dotted with knobby protru- 
sions. Inside, the team found structures that resembled the intestine’s 
nutrient-absorbing villi as well as the deep valleys between them called 
crypts. “The structures, to our total astonishment, looked like real guts,” 
Clevers says. “They were beautiful” 

The mini-guts, reported in 2009, may prove to be a powerful tool in 
personalized medicine. Clevers and his team are using them to study the 
effectiveness of drugs in people with cystic fibrosis, who have genetic 
defects that affect ion channels and disrupt the movement of water in 
and out of the cells lining the lungs and intestine. The researchers take 
rectal biopsies from people with the disease, use the cells to create per- 
sonalized gut organoids and then apply a potential drug. If the treat- 
ment opens the ion channels, then water can flow inwards and the gut 
organoids swell up. “It’s a black-and-white assay,’ Clevers says, one that 
could prove quicker and cheaper than trying drugs in people to see 
whether they work. 
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THE ORGANOID BANK 


Since the late 2000s, biologists have grown a wide variety of rudimentary 
organs to understand development and for medical uses. 


Organoid Potential application 


Cerebral cortex Understand brain development, as well as 


neurodegenerative diseases and other disorders 


Intestine Personalized organoids for identifying patient-tailored 
drugs 

Optic cup Source of retinal tissue for eye therapies 

Pituitary gland Source of therapeutic cells for endocrine disorders 

Kidney Toxicity testing and a source of tissue for transplantation 

Liver Repair of damaged liver 

Pancreas Treat diabetes and identify drugs for pancreatic cancer 

Neural tube Study nerve development and a source of cell therapies 

Stomach Understand stomach development and model gastric 
disorders such as ulcers 

Prostate Predict effective drug combinations for prostate cancer 

Breast Understand tumour development 

Heart Study cardiac development and how drugs affect it 

Lung Model for lung development, maturation and disease 


He has already used the system to assess whether a drug called 
Kalydeco (ivacaftor), and 5 other cystic-fibrosis drugs, will work in 
about 100 patients; at least 2 of them are now taking Kalydeco asa result. 

Organoids may also help physicians to choose the best therapies for 
people with cancer. Earlier this year, Clevers revealed that he had grown 
a bank of organoids from cells extracted from colorectal tumours’, and 
David Tuveson, a cancer researcher at Cold Spring Harbor Laboratory 
in New York, worked with Clevers to generate pancreas organoids using 
biopsies taken from people with pancreatic cancer®. In both cases, the 
organoids could be used to find drugs that work best on particular 
tumours. “What patients are looking for is a logical approach to their 
cancer, Tuveson says. “I’m very excited about what we're learning” 


THE SMALL-SCALE STOMACH 


That excitement is shared by developmental 
biologist James Wells, who last year reported 
that he and his team had created an organoid 
that resembled part of a human stomach’. 

Wells started with a different raw material to 
Clevers, whose organoids arise from adult stem 
cells that can generate only a limited number of 
cell types. Wells, who is at the Cincinnati Children’s Hospital Medical 
Center in Ohio, and his colleagues craft organoids from embryonic 
stem cells, which have the ability to become almost any type of cell. Asa 
result, they have been able to create mini-organs that are more complex. 

A decade ago, Wells and his colleagues began trying to coax human 
embryonic stem cells to form intestinal cells. When the team manip- 
ulated two key signalling pathways, the layer of cells produced tiny 
round buds. Wells noticed that these ‘spheroids’ mimicked sections of 
the primitive gut tube, which forms four weeks after conception. This 
was thrilling, because he realized that he now had a starting point from 
which to develop a variety of organoids. “Every organ from your mouth 
down to your anus — oesophagus, lungs, trachea, stomach, pancreas, 
liver, intestine, bladder — all of them come from this very primitive 
tube,” he says. 

Wells and his colleagues mined the literature and their own expe- 
rience to determine what chemical cues might send these gut tubes 
down the developmental path toward a specific organ. Using this strat- 
egy, in 2011 the team developed its first human organoid’, an intestine 
about the size of a sesame seed. But growing a stomach was a bigger 
challenge. In humans, the organ has two key areas: the fundus at the 
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top, which churns out acid, and the antrum towards the base, which 
produces many key digestive hormones — and the signalling path- 
ways that lead to one versus the other were unknown. What is more, 
“the human stomach is different from the stomachs of most animals 
that we use in the lab’, so there is no good animal model, says Kyle 
McCracken, a former graduate student of Wells and now a medical 
student at the centre. 

The researchers went for a trial-and-error approach: they made some 
educated guesses and painstakingly tested different combinations of 
growth factors. Eventually, the effort paid off. In a 2014 paper’, Wells 
and his team revealed that they had created organoids that resembled 
the antrum. Using these as a model system, the team says that it has 
figured out the chemical trigger that prompts the development of a fun- 
dus. Now the researchers are working to answer other basic questions 
about stomach development and physiology, such as which factors 
regulate acid secretion, and they are trying to generate other mini- 
organs from their primitive gut tubes. 

This newfound ability to examine human development excites Daniel 
St Johnston, a developmental geneticist at the University of Cambridge's 
Gurdon Institute. “You can actually watch how the cells organize them- 
selves to make complicated structures,’ he says — something that is 
impossible in a human embryo. But most organoids are still single tis- 
sues, which limits what developmental biologists can learn, he says. 
“There are certain questions you cant really address because they 
depend upon the physiology of the whole organism.” 


THE BABY KIDNEY 


Melissa Little has spent more than a decade 
marvelling at the complexity of the kidney. “It 
has, in an adult, probably 25-30 different cell 
types, each doing different jobs, she says. Tubu- 
lar structures called nephrons filter fluid from 
the blood and produce urine. The surrounding 
space, called the interstitium, holds an intricate 
network of blood vessels and the plumbing that carries urine away. 

In 2010, Little and her colleagues started trying to turn embryonic 
stem cells into a progenitor cell that gives rise to nephrons. For three 
years, they tried various combinations and timings of growth factors. 
“Tt really took a lot of mucking around to make progress,’ she says. But 
finally, in 2013, the team landed on just the right mixture. Little had 
been aiming to produce just the progenitor cells. But when she looked 
in the dish she saw two cell types spontaneously patterning themselves 
as they would in an embryo. “There was a moment of, ‘Oh wow. Isn’t 
that amazing,’ she says. 

This organoid resembles an embryonic kidney rather than an adult 
one: it has a mix of nephron progenitors and the cells that give rise to 
urine-collecting ducts’. “If you want to get them to mature further, 
that’s where the challenge really lies,’ Little says. So her team has been 
working to grow a more-sophisticated version — with blood vessels 
and interstitium. The hope then is to transplant the mini-organs into 
mice to see if they will mature and produce urine. “I'm pretty excited 
about what we can build,” Little says. 

Because the kidney plays a key part in drug metabolism and excre- 
tion, Little thinks that her mini-kidneys could be useful for testing drug 
candidates for toxicity before they reach clinical trials. And researchers 
say that other human organoids, such as heart and liver, could similarly 
be used to screen drug candidates for toxic effects — offering a better 
read-out on the response of an organ than is possible with standard 
tissue culture or animal testing. 

But Michael Shen, a stem-cell researcher at Columbia University in 
New York who has created a prostate organoid, is sceptical that these 
model systems could completely replace lab animals. Animals can show 
how a therapy affects the immune system, for example, something that 
organoid systems cannot currently do. “You want to be able to validate 
your experimental findings in an in vivo system, he says. “I view that 
as a rigorous test.” 
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LITTLE LIVERS 


Takanori Takebe was inspired to grow a liver after 
a chilling spell in New York. While working in 
the organ-transplantation division at Columbia 
University in 2010, Takebe saw people die from 
liver failure owing to a lack of organs. “That was 
asad situation,” he says. When he looked into tis- 
sue engineering, he thought that the usual meth- 
ods — seeding cells onto an artificial scaffold — seemed destined to 
fail. Part of the problem, he says, is that adult liver cells are very difficult 
to grow. “We cannot maintain it in culture for even a couple of hours.” 

Takebe, who took up a research position at Yokohama City University 
in Japan, decided to work on induced pluripotent stem (iPS) cells, adult 
cells that have been reprogrammed to behave like embryonic stem cells. 
He coaxed human iPS cells into forming liver-cell precursors, or hepato- 
blasts. In the embryo, hepatoblasts rely on a complex symphony of signals 
from other nearby cells to mature, and Takebe suspected that these sup- 
port cells would also be necessary to develop a liver in a dish. He and his 
colleagues mixed hepatoblasts with such cells — called mesenchymal and 
endothelial cells — and it worked. The team managed to create ‘liver buds, 
structures no bigger than a lentil that resemble the liver ofa six-week-old 
human embryo”. The researchers went on to find that, unlike mature 
liver cells, such structures can survive in culture for as long as two months. 

A liver bud is still a far cry from an entire liver — a hefty, multi-lobed 
organ composed of tens of billions of hepatocytes. But Takebe hopes that 
ifhe can infuse many thousands of buds into a failing organ, he might be 
able to rescue enough of its function to make a transplant unnecessary. 
The process seems to work in mice. When Takebe and his group trans- 
planted a dozen of the buds into mouse abdomens, they saw dramatic 
effects. Within two days, the buds had connected up with the mouse’s 
blood supply, and the cells went on to develop into mature liver cells that 
were able to make liver-specific proteins and to metabolize drugs. To 
mimic liver failure, the team wiped out the animals’ natural liver function 
with a toxic drug. After a month, most of the control mice had died, but 
most of those that received liver bud transplants had survived. 

Takebe and his team hope to start human trials in four years. “We will 
target the children that critically need a liver transplant,’ he says. He 
and his colleagues are currently working to make the liver buds smaller 
and produce them in huge quantities that they can infuse through the 
large portal vein that feeds the liver. Takebe thinks that the timeline is 
“doable” But Smith says that the process seems rushed, and that the basic 
biology of these organs needs to be well understood before they are used 
in the clinic. “It’s like running before you can walk,” he says. 

Biologists know that their mini-organs are still a crude mimic of their 
life-sized counterparts. But that gives them something to aim for, says 
Anthony Atala, director of the Wake Forest Institute for Regenerative 
Medicine in Winston-Salem, North Carolina. “The long-term goal is that 
you will be able to replicate more and more of the functionality ofa human 
organ.” Already, the field has brought together developmental biologists, 
stem-cell biologists and clinical scientists. Now the aim is to build more- 
elaborate organs — ones that are larger and that integrate more cell types. 

And Wells says that even today’s rudimentary organoids are facilitating 
discoveries that would have been difficult to make in an animal model, 
in which the molecular signals are hard to manipulate. “In a Petri dish 
it's easy,’ he says. “We have chemicals and proteins that we can just dump 
onto these cells.” = 


Cassandra Willyard is a science writer based in Madison, Wisconsin. 
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Members of the US Army attend a physics lecture at the Massachusetts Institute of Technology in 1944. 


From blackboards 
to bombs 


Seventy years after the destruction of Hiroshima 
and Nagasaki by nuclear weapons, David Kaiser 
investigates the legacy of ‘the physicists’ war’. 


1945, mushroom clouds erupted sky- 
ward above the smouldering cities of 
Hiroshima and Nagasaki, Japan. For the 
first time — and, so far, the only time — 
nuclear weapons had been used in combat. 
Hundreds of thousands of people perished. 


Oe years ago, on 6 and 9 August 


Many died from the immediate force and fire 
of the blasts; others succumbed later to acute 
radiation sickness. Days after the bombs 
were dropped, Japan surrendered and the 
Second World War lumbered to a close. 
The Second World War marked an 
unprecedented mobilization of scientists 
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and engineers, and a turning point in the 
relationship between research and the state. 
By the end of the war, the nuclear weapons 
project, code-named the Manhattan Engineer 
District, absorbed thousands of researchers 
and billions of dollars. It sprawled across 
30 facilities throughout the United States and 
Canada, with British teams working alongside 
Americans and Canadians. Allied efforts on 
radar swelled to comparable scale. 

The drama with which the war ended — 
the detonation of nuclear weapons over cities 
— cemented the association of the Second 
World War as ‘the physicists’ war’ Yet the 
term had been coined long before August 
1945, and originally it had nothing to do 
with bombs or radar. Rather, the physicists’ 
war had referred to an urgent, ambitious 
training mission: to teach elementary phys- 
ics to as many enlisted men as possible. 

Both views of how scientists could serve 
their nations — the quotidian and the cata- 
clysmic — have shaped scientific research 
and higher education to this day. 


CATCHY PHRASE 
In late November 1941, just weeks before 
the United States entered the global con- 
flict, James Conant explained in a newslet- 
ter of the American Chemical Society that 
“this is a physicist’s war rather than a chem- 
ist’s”’. Conant was well-placed to know: 
he was president of Harvard University in 
Cambridge, Massachusetts, chair of the 
US National Defense Research Committee 
(NDRC), and a veteran of earlier chemical- 
weapons projects. 

The phrase had instant appeal; others 
quickly began to quote it. In 1949, for 
example, Life magazine profiled’ physicist 
J. Robert Oppenheimer, who had served 
as scientific director of the wartime Los 
Alamos laboratory in New Mexico, a central 
node of the Manhattan Project. Referring to 
massive military projects such as the bomb 
and radar, the reporter invoked “the popular 
notion” that the Second World War had been 
“a physicists’ war”. 

By that time, the meaning of Conant’s 
formulation seemed self-evident. The First 
World War, with its notorious battlefield 
uses of poison gases such as phosgene and 
chlorine, had been dubbed the chemists’ 
war. The bomb and radar presented a logi- 
cal counterpoint. 
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> But Conant had very different ideas 
when he introduced the now-famous 
phrase. It was hardly clear in November 
1941 that either the bomb or radar would 
change the tide of the war. The Radiation 
Laboratory, or ‘Rad Lab, at the Massa- 
chusetts Institute of Technology (MIT) in 
Cambridge — which served as headquar- 
ters for the Allied effort to improve radar 
— was just one year old. A prototype radar 
system had recently been rejected by a US 
Army review board, 
and NDRC fund- 


“Physics 
ing had nearly been ee 
revoked. The Man- heramsa 
hattan Project did not ° 

rationed 


exist yet. Los Alamos 
still housed a private 
boys’ school. A year 
and a half would elapse before the mud- 
caked ranch houses were requisitioned for 
the top-secret laboratory. 

There is also the matter of secrecy. Conant 
oversaw both radar development and the 
nascent nuclear-weapons programme; infor- 
mation about each was strictly classified. 
An experienced, high-ranking government 
adviser such as Conant surely did not intend 
to disclose some of the nation’s most closely 
guarded secrets. 

And there is the nature of the radar and 
bomb projects themselves. Although each 
was directed by physicists, they teemed with 
specialists of all stripes. By the end of the war, 
physicists were a minority — only about one- 
fifth — of the Rad Lab staff. At Los Alamos, 
the wartime organization chart displayed the 
groups — metallurgy, chemistry, ballistics, 
ordnance and electrical engineering, as well 
as physics — arranged ina circle, connected 
by spindly links. No group appeared on top 
directing the others. Researchers at both the 
Rad Lab and Los Alamos forged new kinds 
of hybrid, interdisciplinary spaces during the 
war. Neither could be categorized simply as 
a physics laboratory’ (see D. Kaiser Nature 
505, 153-155; 2014). 

So what was Conant talking about? 


commodity.” 


CLASSROOM MOBILIZATION 

To most scientists and policy-makers in the 
early 1940s, the physicists’ war referred to a 
massive educational mission. 

In January 1942, the director of the Ameri- 
can Institute of Physics (AIP), Henry Barton, 
citing Conant, began issuing bulletins enti- 
tled ‘A Physicist’s War’. Barton reasoned that 
“the conditions under which physicists can 
render services to their country are chang- 
ing so rapidly” that department chairs and 
heads of laboratories needed some means 
of keeping abreast of evolving policies and 
priorities. The monthly bulletins focused on 
two main topics: how to secure draft defer- 
ments for physics students and personnel, 
and how academic departments could 
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meet the sudden demand for more physics 
instruction. 

Modern warfare, it seemed, required 
rudimentary knowledge of optics and 
acoustics, radio and circuits. Before the war, 
the US Army and Navy had trained techni- 
cal specialists from within their own ranks, 
at their own facilities. The sudden entry 
of the United States into the war required 
new tactics. University physicists, consult- 
ing with army and navy officials, reported 
early in the conflict that enrolments in high- 
school physics classes would need to jump 
by 250%. Their goal: half of all high-school 
boys in the country should spend at least 
one class per day focusing on electricity, 
circuits and radio’. 

The navy and the army also called for 
massive numbers of military personnel to 
receive basic training in physics at estab- 
lished colleges and universities. Draft curric- 
ula circulated between military officials and 
the AIP. The army, for example, wanted the 
new courses to emphasize how to measure 
lengths, angles, air temperature, barometric 
pressure, relative humidity, electric current 
and voltage. Lessons in geometrical optics 
would emphasize applications to battlefield 
scopes; lessons in acoustics would drop 
examples from music in favour of depth 
sounding and sound ranging. 

So acute was the need to teach elemen- 
tary physics that a special committee 
recommended that university departments 
discontinue courses in atomic and nuclear 
physics for the duration of the war so as 
to devote more teaching resources to truly 
“essential” material”. 

Between December 1942 and August 
1945, a quarter of a million students passed 
through elementary physics classes at 


BRANDING EXERCISE 


US colleges and universities. Staffing the 
inflated classrooms required military-style 
planning and logistics. Barton’s bulletins 
warned that any universities found to be 
hoarding valuable physics teachers — much 
less poaching them from other schools — 
would be subject to “severe criticism”. Barton 
developed a complicated formula for what 
he termed the acceptable “ratio of genuine to 
‘ersatz’ teachers of physics” in any given insti- 
tution. Physics teachers became a rationed 
commodity: like rubber, gasoline and sugar, 
they were in critically short supply. 

Draft policies quickly followed. The 
US government created a National Commit- 
tee on Physicists in December 1942 — the 
first of its kind for any academic speciality 
— to advise local draft boards on the need 
for teaching-related deferments. Soon the 
phrase the physicists’ war echoed through- 
out newspapers, popular magazines and 
even congressional testimony. Use of the 
phrase peaked in 1943, long before there 
was much to report (classified or otherwise) 
about the Manhattan Project (see ‘Branding 
exercise’). 


SECRECY AND THE SMYTH REPORT 
Use of the phrase the physicists’ war 
rebounded every decade or so, usually 
around an anniversary of the bombings of 
Hiroshima and Nagasaki. The highest post- 
war peak accompanied the publication of 
Richard Rhodes’s Pulitzer-prizewinning 
book, The Making of the Atomic Bomb, in 
1986. By then, Conant’s phrase had long 
since been linked with classified military 
projects rather than classroom instruction. 
The transition began almost as soon as 
the bombs were dropped on Japan. Gen- 
eral Leslie Groves, who was overseeing the 


The phrase ‘the physicists’ war’, coined to call for mass education as the United States 
entered the Second World War, took on a new life after August 1945. 
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Manhattan Project, anticipated that the gov- 
ernment would need to have some informa- 
tion ready to release about the top-secret 
nuclear-weapons project — pre-cleared and 
available for wide distribution — in case the 
bombs were ever used. Early in the project, 
he tapped nuclear physicist Henry DeWolf 
Smyth at Princeton University in New Jer- 
sey to spend the war visiting each Manhattan 
Project site, compiling a technical report that 
would be suitable for public dissemination. 

On the evening of 11 August 1945, just two 
days after the bombing of Nagasaki, the US 
government released Smyth's 200-page docu- 
ment under the ponderous title, ‘A General 
Account of Methods of Using Atomic Energy 
for Military Purposes under the Auspices of 
the United States Government, 1940-1945" 
Quickly dubbed ‘the Smyth report, cop- 
ies flew off the shelves. The original Gov- 
ernment Printing Office edition ran out 
so quickly that Princeton University Press 
published its own edition late in 1945, under 
the more manageable title, Atomic Energy 
for Military Purposes, which sold more than 
100,000 copies in a year. 

Security considerations dominated what 
Smyth could include. Only information 
that was already widely known to working 
scientists and engineers, or which had “no 
real bearing on the production of atomic 
bombs’, was deemed fit for release. Little of 
the messy combination of chemistry, metal- 
lurgy, engineering or industrial-scale man- 
ufacturing met these criteria; these aspects 
of the huge project, crucial to the actual 
design and production of nuclear weapons, 


Reserve Officer Training Corps students use a portable radio as part of their training at MIT (about 1944). 
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remained closely guarded’. 

So Smyth focused on ideas from physics, 
pushing theoretical physics, in particular, to 
the forefront. Ironically, most people read 
in Smyth's report the lesson that physicists 
had built the bomb (and, by implication, had 
won the war)®”. Later reports, such as ‘Essen- 
tial Information on Atomic Energy, issued 
in 1946 by the new Special Committee on 
Atomic Energy of the US Senate, borrowed 
liberally from the Smyth report, depicting 
nuclear weapons as the latest in a series of 
developments in theoretical physics. A 
chronological table at the end extended the 
narrative as far back as 400 Bc to the ancient 
Greek atomists. There was little mention of 
the Berlin chemistry laboratory in which 
nuclear fission had been discovered late in 
1939, much less the work of chemical engi- 
neers at US company DuPont, who scaled 
up plutonium-producing nuclear reactors 
during the war. 


LONG SHADOW 

The change in referent for the physicists’ 
war — from blackboards to bombs — 
had serious implications. After the war, 
physicists in the United States bore the 
largest brunt of any academic group dur- 
ing the ‘red scare’ of the 1950s, promoted 
by Senator Joseph McCarthy. The House 
Un-American Activities Committee held 
27 hearings on allegations against physi- 
cists, twice the number for any other schol- 
arly discipline. If nuclear weapons had been 
made by physicists, the reasoning went, 
then physicists must have special access to 
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the ‘atomic secrets’ with which such bombs 
could be made. Thus the loyalties of this 
group required the closest scrutiny’. 

Meanwhile, the two meanings of the 
physicists’ war blurred together as the cold 
war intensified. More and more universities 
became contracting sites for military and 
defence agencies, continuing the model that 
Conant and others had forged during the 
war. Physicists’ research budgets ballooned, 
and enrolments grew faster than in any other 
field, doubling every few years. 

More physicists were trained in the 
United States, United Kingdom and Soviet 
Union in the quarter-century after the war 
than had been trained throughout all of pre- 
vious history. Yet the aims of the training 
shifted in the 1950s and 1960s. Rather than 
teaching soldiers some elementary phys- 
ics to prepare them for the battlefield, US 
officials spoke of creating a ‘standing army’ 
of physicists, who could work on nuclear- 
weapons projects without delay should the 
cold war ever turn hot’. 

Three decades after 1945, years into 
the slog of the Vietnam War, many crit- 
ics grew uneasy with the close association 
between physics and war. Campus protesters 
demanded that the defence department get 
out of the higher-education business. At uni- 
versities across the United States, physicists’ 
laboratories became frequent targets for sit- 
ins and even Molotov cocktails”. 

After the protesters dispersed and the tear 
gas lifted, several things had become clear. 
“The physicists’ war’ had massively altered 
the structure of the US university system, the 
organization of scientific research, and the 
relationship between national defence and 
higher education. = 
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Mountain pine beetles have destroyed hundreds of thousands of hectares of forest in Canada. 


Define biomass sustainability 


The future of the bioeconomy requires global agreement on metrics and the creation 
of a dispute resolution centre, say Roeland Bosch, Mattheiis van de Pol and Jim Philp. 


r | Ahe bioeconomy is rising up the 
political agenda. More than 30 coun- 
tries have announced that they will 

boost production of renewable resources 
from biological materials and convert them 
into products such as food, animal feed and 
bioenergy. Non-food crops, such as switch- 
grass (Panicum virgatum), are the main 
focus, as well as agricultural and forestry 
residues and waste materials and gases. 

It is one thing to write a report; it is 
another to puta plan into action sustainably. 
The biggest conundrum is reconciling the 
conflicting needs of agriculture and indus- 
try. Ina post-fossil-fuel world, an increasing 
proportion of chemicals, plastics, textiles, 
fuels and electricity will have to come from 
biomass, which takes up land. By 2050, the 
world will also need to produce 50-70% 
more food’, increasingly under drought 
conditions and on poor soils. 

There is no consensus on what 
‘sustainable’ means. Biomass assessment 
is a patchwork of voluntary standards and 
regulations. With many schemes comes a 
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lack of comparability. Confusion leads to 
mistrust and protectionism, international 
disputes and barriers, slow investment and 
slower growth. 

For example, greater use of wood for elec- 
tricity generation or heating may decrease 
greenhouse-gas emissions if it displaces 
coal. But retaining forests also sequesters 
carbon and protects biodiversity. Increased 
demand boosts wood-pellet prices, and 
puts pressure on businesses, such as saw 
mills, that use wood. The balance of who 
saves or creates emissions shifts when bio- 
mass is exported. 

The geopolitical implications mirror 
those of crude oil. Developed countries 
that lack fossil fuels are thirsty for renew- 
able energies. Some developing countries 
may be tempted to meet that demand 
without accounting for the environmental 
or social cost. It is in everyone's interests 
to harmonize sustainability standards and 
head off disputes before they arise. Govern- 
ments should agree on criteria and define 
metrics for assessing biomass sustainability. 
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And they should consider creating a centre 
for resolving disputes that arise over 
competition for land and biomass. 


NO CONSENSUS 

In 2012, the United States and the European 
Union laid down their intentions to grow 
their bioeconomies””’. Now, the G7 indus- 
trialized countries* and at least 20 others 
either have a dedicated bioeconomy strat- 
egy in place (including Finland, Malaysia 
and South Africa) or have policies consist- 
ent with growing a bioeconomy (including 
Australia, Brazil, China, India and Russia). 

The bioeconomy of Malaysia, for example, 
is expected to grow by 15% per year to 2030. 
The palm-oil industry is central to that plan, 
as it is elsewhere in southeast Asia. 

Making up 45% of the world’s edible oil, 
palm oil can also be processed into biodiesel. 
The oil-palm crop is also more effective at 
sequestering carbon than other major crops. 
Using genomics in selective breeding offers 
great potential for improvements to the eco- 
nomics of palm-oil production’. 


UDO WEITZ/BLOOMBERG/GETTY 


Land disputes between palm-oil compa- 
nies and local communities have already 
begun. Between 2006 and 2010, Indonesia's 
palm-oil plantation area increased dramati- 
cally, from 4.1 million to 7.2 million hectares. 
The increase has been accompanied bya rise 
in deforestation, water pollution, soil erosion 
and air pollution, as well as restrictions on 
traditional land-use rights and land losses, 
increasing land scarcity and land prices®. 

A situation that arose between Canada 
and the EU in 2012 illustrates how rational 
decision-making in different countries can 
lead to disputes. The EU’s Renewable Energy 
Directive sustainability criteria for biofuels 
and bioliquids are non-binding for solid 
biomass. EU biomass sustainability stand- 
ards also prohibit the use of ‘primary forest’ 
materials for bioenergy. 

In Canada, forests are deemed sustainable 
by measures of woodland structure, compo- 
sition and degree of ‘naturalness. Overall, 
the area affected by natural disturbances 
such as insect infestations and wildfire is 
larger than the total area of logging — and 
the use of such damaged trees for bioenergy 
holds potential. But because it stems from 
primary forest that has not been harvested 
or regrown, such wood would be excluded 
from importation into the EU. 

A dispute arose in 2012 between an 
environmental organization and an energy 
company wishing to ship wood pellets to 
Europe that had originated from Canadian 
primary forest infested with the mountain 
pine beetle (Dendroctonus ponderosae). 
The Dutch government used the case to 
see whether mediation might work in such 
circumstances — and it did. The dispute 
was settled. 

Increasing demand for biomass makes it 
likely that such disputes will recur. Limited 
land mean that Europe cannot grow enough 
biomass to meet its own future demand. 
Depending on bioenergy policies, biomass 
use is expected to continue to rise to 2030 
and imports to Europe are estimated to tri- 
ple by 2020. Wood-pellet use for large-scale 
power generation is increasing dramati- 
cally in Europe. Some countries including 
Germany and Denmark have become net 
importers. Europe may import’ 80 million 
tonnes of solid biomass per year by 2020. 

Today’s biomass situation bears similarities 
to that in the 1980s, when a system of national 
agricultural subsidies in Europe threatened to 
start trade wars. Policies directed at produc- 
ing more food combined with rapid technical 
progress and structural changes led to agri- 
cultural trade barriers. Domestic surpluses 
of farm goods were stocked or exported with 
subsidies — giving rise to the European ‘but- 
ter mountains’ and ‘wine lakes’ — by protect- 
ing farm producers at the cost of domestic 
consumers and producers abroad. The costs 
weighed heavily on government budgets. 


Consumers in countries with protected 
markets faced higher food bills, and pro- 
ducers in other countries were penalized by 
restrictions on access to those markets*. 

The Organisation for Economic Co-oper- 
ation and Development (OECD) helped to 
resolve that situation by developing stand- 
ards for agricultural subsidies, which are 
accepted globally. An analogous, inter- 
nationally agreed biomass sustainability 
governance framework is now needed. 


SUSTAINABILITY METRIC 

A metric for evaluating biomass sustain- 
ability needs to be designed. Social as well 
as environmental and economic factors 
must be included. As yet there is no con- 
sensus on what criteria should be used. 
For example, international stakeholders 
(non-governmental organizations, policy- 
makers, research and development, bioen- 
ergy producers, end-users and traders) from 
25 European and 9 non-European countries 
surveyed in 2011 agreed’ unanimously on 
only one criterion — minimization of green- 
house-gas emissions. 

Aggregation of sustainability issues into a 
single measure requires complicated trade- 
offs between, say, kilograms of carbon 
dioxide emissions and labour conditions. 
Practitioners’ own weightings are subjective. 
Life-cycle analysis — a technique to assess 
environmental impacts of all the stages 
of the manufacture, use and disposal of a 
product — does not look at social impacts. 
To define an index, multiple variables must 
be expressed using a common denominator. 

Using price information is understood by 
policy-makers and the market. But placing 
monetary values on social and ethical costs 
and benefits is contentious. Differences 
between developed and developing coun- 
tries require careful handling — for exam- 
ple, the different reactions to placing a cost 
on child labour. 

A starting point could be the total factor 
productivity (TFP) metric used to measure 
agricultural sustainability. The TFP reflects 
the rate of transformation of inputs (capital, 
labour, materials, energy and services) into 
outputs (biomass stock). A cost is attrib- 
uted to each and to the negative social and 
economic impacts. 

To develop the TFP approach into an 
integrated methodology for assessing bio- 
mass sustainability, it should take account of 
changing conditions and local situations in 
biomass importing and exporting countries. 
For example, just like oil, the price of biomass 
will fluctuate. Expensive producers would 
be hurt. Sustainability assessment will need 
to take account of such fluctuations. A tenet 
of a bioeconomy is decentralized feedstock 
access — biomass-sustainability assessment 
needs the flexibility to take such matters into 
account. 
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Biomass metrics could be aligned with 
the United Nations’ Sustainable Develop- 
ment Goals, the indicators!’ of which are 
similar. A step towards this alignment may 
be achieved in November at the Global Bio- 
economy Summit in Berlin. This meeting of 
more than 500 leaders from policy, research, 
industry and civil society organized by the 
German Bioeconomy Council (an inde- 
pendent advisory body to the German gov- 
ernment) could result in recommendations 
around global governance and international 
cooperation. 

Beyond that, international agreement is 
needed on the key biomass sustainability 
criteria. The OECD could host initial dis- 

cussions including 


“Diff erences exploring the setting 
between up of an international 
developedand biomass dispute 
developing settlement facility. 
countries Developing countries 
require and biomass produc- 
careful ers outside the OECD 


should also be repre- 
sented at such talks. 
These possibilities will be discussed at the 
OECD Committee for Scientific and Tech- 
nological Policy ministerial-level meeting in 
October hosted by the South Korean govern- 
ment in Daejeon. = 


handling.” 
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Many ways to access 
hominin fossil finds 


Michael Cherry's perspective of 
elitist tourism at South Africa's 
Cradle of Humankind World 
Heritage Site in Gauteng does not 
take into account extensive efforts 
by scientists, the government 
and the private sector to bring a 
wider understanding of human 
evolution and our shared heritage 
to the South African public 
(Nature 523, 33; 2015). 
Excavations of hominin fossils 
at the Malapa caves, viewed from 
an overhead structure (pictured), 
may need wealthy tourists to 
support access to the site, but 
plans include free school tours 
on at least one day each month. 
The Rising Star cave will be open 
as a nature and heritage reserve 
for an entry fee of just a few rands 
(12 rands is US$1). A virtual lab at 
the Maropeng visitor centre will 
also allow people to watch Malapa 
fossils being prepared online. 
Last year, 38,000 schoolchildren 
visited our education facilities 
at the active palaeontological 
site of Sterkfontein. The 
outreach programmes at the 
University of the Witwatersrand’s 
Evolutionary Studies Institute, 
together with privately funded 
activities, reach about 200,000 
more schoolchildren, most from 
disadvantaged backgrounds. 
Public exhibitions of our 
fossil hominin treasures are held 
frequently at museums around 
South Africa, with up to 15 on 
display at any one time. The 
government and the University 
of the Witwatersrand have also 
gifted casts of Australopithecus 
sediba fossils to museums across 
Africa and around the world. 
Lee R. Berger University of the 
Witwatersrand, South Africa. 
lee. berger@wits.ac.za 


Initiatives to bridge 
faith and science 


Two initiatives aim to increase 
awareness and acceptance 

of science by US Christian 
communities, some of which 


resist science-education efforts. 


BioLogos (http://biologos. 
org) was founded in 2007 by 
Francis Collins, then leader of 
the Human Genome Project, to 
encourage other Christians to 
accept evolution in the context 
of their faith. Trust and respect 
for Collins has been key to its 
success. Its grant programme 
has so far disbursed a total of 
US$3.9 million to 37 faith- 
science partnerships. 

In fostering dialogue between 
theologians and scientists who 
are Christians, BioLogos is 
forging a middle ground between 
presentations of science that are 
antagonistic towards faith and 
faith that will not accommodate 
science. Last month, for 
example, a BioLogos conference 
of scientists, theologians and 
pastors helped to articulate 
the overlap between theology 
and evolutionary theory (see 
go.nature.com/ovnpwb). 

A programme by the 
American Association for 
the Advancement of Science 
takes a different approach, in 
partnership with the Association 
of Theological Schools (www. 
scienceforseminaries.org). Their 
pilot project has helped ten 
seminaries since 2013 to integrate 
science into the curriculum for 
training religious leaders. One 
seminary includes evolutionary 
biology in courses on biblical 
interpretation; others teach 
neuroscience, genetics or ecology 
within explorations of identity 
and environmental stewardship. 


Science education is a public 
good that we as scientists should 
help to reinforce across all faiths 
with partnerships such as these. 
S. Joshua Swamidass 
Washington University in 
St. Louis, Missouri, USA. 
swamidass@wustl.edu 


Tree rings track 
climate trade-offs 


The information held in annual 
tree rings offers further insight 
into the potential of the world’s 
forests to slow global warming 
(see Nature 523, 20-22; 2015). 
These data reveal fluctuations in 
growth rate caused by climatic, 
physiological and ecological 
factors, and provide long-term 
and spatially extensive records. 

Tree-ring measurements 
provide information on 
functional trade-offs that can 
affect a tree’s future, such as 
altered hydraulic and growth 
responses to the long-term rise 
in atmospheric carbon dioxide 
(see P. van der Sleen et al. Nature 
Geosci. 8, 24—28; 2015). And 
unlike data from remote sensing 
of forests and monitoring of tree 
diameters, tree-ring observations 
extend over centuries. 

Such studies also mitigate 
the problems you mention of 
investigating small forest patches, 
because the size and number of 
research areas can be adjusted. 

A relatively small percentage of 
ring-forming species in tropical 
forests and the fading record of 


tree-ring collections further back 
in time are impediments. Also, 
most existing tree-ring collections 
are not representative of trees that 
lived in the past (E. Babst et al. 
Oecologia 176, 307-322; 2014). 
We suggest that combining 
retrospective tree-ring analysis, 
repeated forest inventory 
measurements and modelling 
studies could more effectively 
predict responses of the world’s 
forests to climate change. 
Pieter A. Zuidema Wageningen 
University, the Netherlands. 
David Frank Swiss Federal 
Research Institute WSL, 
Birmensdorf, Switzerland. 
david.frank@wsl.ch 


Universities aim for 
a sustainable future 


A survey of several European 
universities shows that corporate 
responsibility for the economic, 
social and environmental effects 
of commercial activities is slowly 
spreading to the non-profit 
sector in Europe — including to 
institutions of higher education. 

The survey, undertaken in 
2014, polled 73 members of the 
European University Association, 
representing 18 countries across a 
range of institutional age, size and 
ranking (see Y. Fassin in Proc. 
26th Int. Assoc. Bus. Soc.; in the 
press). Of those 73 universities 
(response rate 11%), 82% had 
incorporated sustainability and 
the European Commission’s 
corporate social responsibility 
recommendations into their 
development strategies; 70% 
were already reporting on their 
initiatives. 

The survey revealed that these 
universities tend to emphasize 
sustainable development over 
social responsibility. Perhaps 
because social responsibility 
isa given for universities, 
sustainability for future 
generations of students seems to 
hold more sway with university 
management. 

Yves Fassin Ghent University, 
Belgium. 
yves.fassin@ugent. be 


30 JULY 2015 | VOL 523 | NATURE | 531 


© 2015 Macmillan Publishers Limited. All rights reserved 


DARREN ALEXANDER 


NEWS & VIEWS 


For News & Views online, go to 
nature.com/newsandviews 


SUSTAINABILITY 


Bypassing the methane cycle 


A genetically modified rice with more starch in its grains also provides fewer nutrients for methane-producing soil 
microbes. This dual benefit might help to meet the urgent need for globally sustainable food production. SEE LETTER P.602 


PAUL L. E. BODELIER 


espite being much less abundant in 
D the atmosphere than carbon dioxide, 

methane has a higher capacity to 
absorb heat emitted from Earth's surface, and 
thereby contributes substantially to global 
warming'. The demonstration’ that reduc- 
ing emissions of this greenhouse gas can 
be achieved more easily and more rapidly 
than for CO, has spurred an avalanche of 
studies on possible methane-mitigation 
strategies. Rice cultivation is the largest 
single source of methane linked to human 
activity, and methane emissions from rice 
paddies are expected to increase with a 
rising global demand for food. On page 602 
of this issue, Su et al.’ describe a ‘high-starch, 
low-methane’ rice variety that represents a 
tremendous opportunity for more-sustainable 
rice cultivation. 

In rice, the leaves and stems take up CO,, 
which is transformed through photosynthesis 
into sugars that are used to produce plant bio- 
mass or storage compounds, such as starch, in 
the shoots, roots and rice grains. Carbon from 
decaying roots or that is directly released by 
roots into the soil in the form of sugars, amino 
acids and organic acids, can be transformed by 
decomposer microorganisms into substrates 
(CO,, hydrogen and acetate). In the absence 
of oxygen, these substrates are turned into 
methane by methanogenic microorganisms. 
The methane can remain in the soil, escape 
to the atmosphere through diffusion into and 
emission by the rice plants, or be intercepted by 
methane-consuming bacteria in the root zone 
or the soil’s surface layer. These bacteria use 
methane as a substrate for oxygen-requiring 
respiration, producing CO. Hence, methane 
emission from rice soils is determined by the 
balance of methane-producing and methane- 
oxidizing microbes, the availability of other 
substrates, and the activity of microbes compet- 
ing for these compounds — collectively, these 
processes constitute the methane cycle (Fig. 1). 

Existing efforts to mitigate rice-associated 
methane emissions have focused mainly* on 
agricultural practices — such as water man- 
agement, fertilizer use, tillage and crop selec- 
tion — that alter the environmental conditions 
for methanogenic microorganisms. However, 
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Figure 1 | High-starch rice in the environment. The transgenic SUSIBA2 rice described by Su et al.’ 
produces grains with a high starch content by diverting more carbon (derived from photosynthesis) into 
grains and stems, and less into roots (red arrows). This results in less carbon being input into the soil by the 
plants and thus being available to decomposer microorganisms that supply carbon-containing substrates to 
methane-producing microbes. These effects combine to dramatically reduce methane emission from areas 
planted with this rice strain. However, the amount of methane emitted depends on a complex interplay 
between plant physiology and the activity of these and other microorganisms, including competitors for 
substrates and methane consumers. These interactions can all be influenced by the amount and type of 
carbon compounds and nutrients available, as well as by the amount of oxygen expelled by the roots. The 
effects of SUSIBA2 rice on many aspects of this cycle are not yet clear (question marks). 


these measures are labour intensive and their 
applicability varies between rice-cropping sys- 
tems and between countries. In 2002, it was 
observed’ that the larger the amount of grain 
carried by the rice plants, the less methane is 
emitted, because carbon fixed in the grains 
does not become available for soil microbes to 
turn into methane. This observation suggested 
a globally applicable methane-mitigation 
solution: produce a rice plant with a higher 
proportion of its carbon in the stems and 
grains than current varieties. Crops of this 
plant would not only result in less methane 
emissions, but also give higher grain yield and 
higher nutritional value. But it has taken until 
now for such a plant to be generated. Su et al. 
present evidence from field and greenhouse 
trials of varieties of transgenic rice called 
SUSIBA2 that fulfil these criteria. 

The authors generated their rice varieties 
by transferring genes from barley that are 
responsible for the production of starch in 
stems and grains; in rice, these genes lead to 
higher starch production, and hence a higher 
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demand for sugars, in these plant parts. The 
transgenic rice displayed the expected prop- 
erties of higher seed weight and higher starch 
content in seeds and stems, but no change in 
starch levels in leaves and roots and a mark- 
edly lower root biomass. By screening the 
expression of a large set of genes involved in 
converting sugar into starch, the authors con- 
firmed that the inserted genetic elements were 
effective only in the seeds and stems. 

The ‘cherry on top’ of these genetic efforts 
was a significant reduction in methane emit- 
ted from SUSIBA2 rice plants, compared 
with a widely grown unmodified variety. The 
methane was measured by covering the plants 
with transparent gas-collection chambers. 
A decrease in methane emission was seen in 
trials of two variants of SUSIBA2 rice in three 
regions of China, measured in three consecu- 
tive growing seasons. The authors also found 
fewer methanogenic microorganisms on and 
around the roots of SUSIBA2 rice, suggesting 
that there was less plant-derived carbon-con- 
taining substrate available for methanogens. 


Although Su and colleagues have made the 
groundbreaking demonstration that high- 
starch, low-methane rice plants can be gener- 
ated, their study raises many issues. The most 
obvious is that SUSIBA2 rice is a transgenic 
plant, and thus raises biological and ethics 
concerns. In addition to the general questions 
surrounding the use of genetically modified 
crops for human consumption, and how access 
to seed for such crops is controlled, we do not 
yet have a clear picture of how this modifica- 
tion affects rice plants’ survival and general 
function. 

Long-term and frequent measurements of 
methane emissions from areas planted with 
normal and transgenic rice are needed to 
estimate what the annual global effect of the 
widespread use of this crop would be, and how 
it compares with that of other methane-miti- 
gation strategies. Even more important will be 
assessment of the long-term consequences of 
lower carbon and oxygen input by the roots 
of SUSIBA2 plants on soil processes and the 
microbes that carry them out (Fig. 1). It has 
recently been shown’ that highly specific 
assemblages of microbial species occur in, on 
and around rice roots, and that not all mem- 
bers use plant-exuded carbon’. Long-term 
reduction of root-exuded carbon might alter 
the composition of these communities, with 
unknown consequences for microbes that are 
plant pathogens or that benefit the plants, such 
as the bacteria that decompose organic mate- 
rial and deliver essential plant nutrients*. 

To compensate for the possible reduction 
in plant nutrients, larger amounts of nitrogen 
fertilizer would need to be applied. This can 
affect both methane producers and consum- 
ers’ and lead to undesirable environmental 
effects, such as nitrate leaching to ground- 
water and emission of the potent greenhouse 
gas nitrous oxide. Also crucial for the amount 
of methane emitted is the activity of methane- 
consuming aerobic bacteria. The oxygen they 
use flows though the plant stems and roots into 
the soil by the same route taken by methane 
moving out of the soil into the atmosphere, and 
it is not known how the transport of gases is 
affected in the transgenic rice. 

Thus, translocating more carbon to the 
stems and seeds of SUSIBA2 rice may bypass 
methane cycling, but this activity has the 
potential to affect a multitude of processes 
involving soil carbon, nutrients and microbial 
activity, with knock-on effects for the sustain- 
ability of rice cultivation. However, Su and 
colleagues have achieved the feat of making 
high-starch rice available, and this will spur 
scientists worldwide to conduct experiments 
to verify whether this variety will enable more- 
sustainable cultivation of the crop that feeds 
half the human population. = 
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Movies of a growth 


mechanism 


A microscopy technique has been used to study the formation and growth of 
crystals of porous solids known as metal-organic frameworks in real time. 
The findings will aid the design of methods for making these useful compounds. 


KRISTA S. WALTON 


aterials called metal-organic frame- 
Mee (MOEFs) have sparked intense 

interest over the past few decades. 
In particular, those that form permanently 
porous architectures have tremendous poten- 
tial for applications such as chemical sensing, 
gas storage and catalysis. But techniques for 
synthesizing these compounds are still often 
developed through trial and error — in part 
because the mechanisms that dictate the 
self-assembly of MOF unit cells from their 
constituent metal ions and ligands, and their 
subsequent growth into nanoparticles, are 
largely unknown and difficult to observe. 
Writing in the Journal of the American Chemi- 
cal Society, Patterson et al.’ help to solve this 


Ligand 


ey 


Metal ion 


problem by reporting the first observations of 
the crystallization of MOFs made in real time, 
using a technique called liquid-cell transmis- 
sion electron microscopy. 

Previous studies of MOF crystallization 
have made use of various ex situ and in situ 
analytical techniques, including high- 
resolution transmission electron microscopy 
(HRTEM) and energy-dispersive X-ray dif- 
fraction (EDXRD). For example, HRTEM has 
been used to examine the crystallization of 
the well-characterized MOF-5, by analysing 
samples taken at various time intervals early 
in the compound’s synthesis’. Time-resolved 
in situ EDXRD has been used to determine the 
kinetics of MOF crystallization as a function of 
parameters that included pH, temperature and 
ligand length**. But the challenge of observing 


A 


Figure 1 | Growth of a metal-organic framework. Patterson et al.' have used liquid-cell transmission 
electron microscopy to study the growth of the ZIF-8 metal-organic framework (MOF) from its 
constituent metal ions and ligand molecules. They find evidence for a two-step process. a, First, the metal 
ions and ligands diffuse towards a nascent ZIF-8 crystal. b, Second, the ions and ligands move to an edge 
site, where they coordinate with each other, becoming part of the MOF lattice. This is the rate-limiting 


step of the process. (Figure adapted from ref. 1.) 
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the crystallization process in real time persists. 

Patterson and colleagues’ use of liquid-cell 
transmission electron microscopy (LCTEM) 
is a big step forward. This technique allows 
dynamic processes that occur in liquids to be 
imaged as they happen. It has been used to 
observe systems such as biological structures” 
and growing nanocrystals®, but had not previ- 
ously been applied to MOF syntheses. 

Because analytical samples can be damaged 
by the electron beam used in LCTEM, the 
authors began by performing a series of con- 
trol experiments using a zirconium-based 
MOF called UiO-66, to decouple the effects 
of beam irradiation on MOF synthesis from 
the effects of the reaction mechanism. UiO-66 
was a good choice for a control because it is 
easy to synthesize and extremely stable, which 
meant that it could be prepared ahead of the 
LCTEM experiments without any risk of it 
degrading before use. The authors observed 
that the dissolution or growth of UiO-66 par- 
ticles depends on the voltage of the electron 
beam. A threshold dosage of 40,000 electrons 
per square nanometre was also established — 
as long as experiments were performed below 
this limit, damage and particle motion during 
crystallization were negligible. 

Patterson et al. then chose another MOF, 
ZIEF-8, as the ideal candidate for demonstrat- 
ing the LCTEM method. The growth mecha- 
nisms of ZIF-8 have previously been studied’ 
using transmission electron microscopy on 
samples removed from synthetic solutions of 
the MOK, which provided a good comparison 
with the LCTEM results. ZIF-8 can be synthe- 
sized at room temperature in methanol, using 
zinc nitrate as the metal source and 2-methyl- 
imidazole molecules as the organic ligands. 
The authors observed the growth of ZIF-8 in 
real time over 11 minutes — the first particles 
detected were 15nm in diameter, with subse- 
quent growth observed up to 50nm. 

The authors went on to record videos of the 
particle formation and used them to determine 
the growth kinetics of the MOF, uncovering 
several important features of the crystalliza- 
tion process. First, they proved by direct obser- 
vation that ZIF-8 particles form through the 
growth of smaller subunits, rather than by 
particles coalescing. They also found that an 
excess of ligand molecules leads to the forma- 
tion of ZIF-8 particles that are smaller than 
those formed when the metal-to-ligand ratio 
is 1:1. The researchers had predicted this using 
ex situ methods, but LCTEM enabled them to 
observe the process as it occurred. 

A series of careful growth experiments was 
then performed under various accumulative 
electron doses. The results convincingly show 
that LCTEM can be applied effectively to study 
nanoparticles that are easily damaged by elec- 
tron beams. It remains to be seen whether the 
technique will be effective at the higher tem- 
peratures (typically greater than 100°C) at 
which most MOFs form. 
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A general conclusion from this work is that 
MOF growth occurs through the transport of 
metals and ligands to a nascent particle, fol- 
lowed by their movement to an edge or surface 
site, where bonding between the metal and 
ligand finally occurs (Fig. 1). The attachment 
of metal-ligand monomers to a surface site 
is therefore the controlling factor in particle 
growth, and the process is not diffusion-limited. 

The development of this ability to watch 
particle formation in situ during MOF self- 
assembly should enable a variety of compli- 
cated synthetic questions to be answered. One 
example is how the addition of ‘modulator’ 
compounds, which are sometimes used in 
MOF syntheses to control the crystallinity 
of the products, affects the growth kinetics. 
For MOFs that can adopt different structures, 
LCTEM could also shed light on what dictates 
whether kinetic products — those that crys- 
tallize most quickly — form during reactions, 
rather than the most thermodynamically stable 
products. LCTEM is a much-needed addition 


MATERIALS SCIENCE 


to the MOF-characterization toolkit, and its 
use in conjunction with other methods will 
no doubt lead to the specific control of crystal 
morphology, compositions and defects. = 
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Composite for energy 
storage takes the heat 


A polymer-based material has been discovered that breaks the rules — it has 
the right combination of properties for use in energy-storage devices called 
dielectric capacitors, and can function at high temperatures. SEE LETTER P.576 


HARRY J. PLOEHN 


evices known as dielectric capacitors 
D have a crucial role in applications that 
require short, intense power pulses or 
the conversion of direct current to alternating 
current. These applications include electronic 
systems for the integration of energy from 
renewable sources into power grids’, trans- 
port’ and military weapon systems’. They 
depend on electrically insulating materials 
known as dielectrics, which come in several 
types. Polymeric dielectrics offer advantages 
for large capacitors, but suffer from low operat- 
ing temperatures (usually well below 150°C) 
and low energy density (which means that 
devices that use polymeric dielectrics occupy 
large volumes). On page 576 of this issue, Li 
et al.* report that a composite of a polymer and 
nanometre-scale sheets of boron nitride pro- 
vides more than a 40% improvement in energy 
density compared with the best-available poly- 
mer dielectric, as well as remarkable stability at 
temperatures up to 300°C across a wide range 
of electric-field frequencies. 
Dielectric capacitors achieve the highest 
rate of energy transfer (termed the power or 
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rate capability) of all capacitor types. They 
store energy through a variety of molecular 
and nanoscale electron-polarization mecha- 
nisms”* that create oriented dipoles and 
associated dipolar electric fields. For high 
energy density, dielectric materials must 
have a high density of dipoles that have large 
induced dipole moments (which provide a 
measure of a charged system's polarity). A 
dielectric’s rate capability depends on how fast 
charges polarize and depolarize — how 
fast the dipoles reorient — as an applied elec- 
tric field varies. Invariably, not all of the energy 
stored in dipolar electric fields is recovered 
on depolarization; some is transferred into 
molecular translation and vibration (thermal 
energy) and is lost as heat, a process called 
dielectric loss. 

When and how polarized electrons begin to 
‘leak’ (conduct) through a dielectric depends 
on a property called the dielectric breakdown 
field strength (£,). Relatively small leakage 
currents may occur at field strengths below E,,. 
Once the field reaches E,, it promotes a cas- 
cade of electrons into the material's conduction 
band, resulting in catastrophic breakdown as 
the dielectric is transformed from an insulator 


into a conductor — which is bad 
news for dielectric capacitors. 
Leakage current also converts 
electrical energy into thermal 
energy. If this heat is not efficiently 
removed, the dielectric’s internal 
temperature rises, amplifying 
molecular motions and degrading 
the material’s mechanical proper- 
ties to the point that electrome- 
chanical stresses create another 
mechanism for dielectric break- 
down, shifting E, to lower values. 

The search for dielectrics that 
have high energy density, high rate 
capability and low conversion of 
electrical energy into heat has fol- 
lowed two distinct, yet intersect- 
ing paths. The first has led towards 
pure, homogeneous materials that 
can be synthesized and fabricated 
into large capacitors at minimal 
cost. The second has led to hetero- 
geneous, multiphase composites, 
which sacrifice some of the ease 
of manufacturing simplicity for 
an optimal compromise between 
properties and performance. 
Homogeneous materials include 
inorganics such as barium titan- 
ate (BTO) and organic polymers 
such as biaxially oriented polypro- 
pylene (BOPP, currently the best polymer die- 
lectric) and polyvinylidiene fluoride (PVDF). 

Polymers are the preferred choice for large 
capacitors because of the ease with which they 
can be processed and their defects controlled. 
However, the Moss rule, which originated 
in the semiconductor field”’, constrains the 
dielectric properties of homogeneous mater- 
ials: an increased polarizability is invariably 
accompanied by a decreased E,, which in turn 
lowers the maximum-attainable volumetric 
energy density, U. 

Heterogeneous materials might be able to 
get around the Moss rule. Most studies of such 
materials are variations on a theme: dispers- 
ing a suitable filler material (such as BTO) in 
a polymer may yield a composite with both 
high polarizability and high energy density, 
as well as adequate processability for fabrica- 
tion into large, reliable capacitors. But often, 
the addition of the filler dramatically reduces 
E,, eliminating any advantage. Dielectric losses 
and thermal management are also unresolved 
issues — even though these are key issues in 
the engineering of large capacitors. 

Liand colleagues’ work addresses a different 
set of issues that may enable polymers to bend 
the Moss rule, ifnot to break it. Scientists from 
the same research group previously reported’ 
that blends of boron nitride nanosheets 
(BNNS) with a PVDF-based polymer resulted 
in remarkable increases in E,, U, charge-dis- 
charge efficiency (the fraction of stored energy 
released on discharge), stiffness and thermal 


Figure 1 | A high-temperature dielectric material. Li et al.’ have made 

a composite material in which boron nitride nanosheets (BNNS) are 
suspended in a polymeric material, cross-linked divinyltetramethyldisiloxane- 
bis(benzocyclobutene). In this micrograph, the BNNS are visible as lighter 
flecks against the darker background of the polymer. The composite has better 
properties as a dielectric material for energy-storage applications than the 
best-available polymer dielectrics, and operates at higher temperatures. 

Scale bar, 5 micrometres. (Image from ref. 4.) 


conductivity compared with the pure poly- 
mer. These improvements were attributed to 
suppression of leakage currents by the BNNS. 
Those nanocomposites achieved Uvalues up 
to ten times that of BOPP, but still suffered 
from high dielectric losses associated with the 
host polymer. 

In the current work, Li et al. blended BNNS 
with a compound called divinyltetramethyl- 
disiloxane-bis(benzocyclobutene) (BCB), and 
then reacted the BCB molecules to produce 
nanocomposites of BNNS in polymeric cross- 
linked BCB (c-BCB,; Fig. 1). The dielectric prop- 
erties of these composites are remarkably stable 
over a wide range of temperatures (from room 
temperature to 300°C) and field frequencies 
(100 hertz to 1 MHz). The volumetric energy 
density and charge-discharge efficiency of 
c-BCB/BNNS composites at 150°C greatly 
exceed those of other polymers designed to 
work at high temperatures, and maintain mean- 
ingful values even at 300°C — more than 200°C 
higher than BOPP’s thermal limit for practical 
use. None of the other candidate high-temper- 
ature polymers studied by Li et al. approaches 
this level of dielectric performance at such high 
temperatures. 

Li and co-workers observed that BNNS 
suppress leakage currents in the nano- 
composite by about a factor of ten compared 
with pristine c-BCB, even at high tempera- 
tures. Remarkably, BNNS also increase the 
polymer’s E, value by 30-50%. More work 
should be carried out to better understand 
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the underlying mechanisms for 
suppression of current leakage 
and dielectric breakdown by 
BNNS, which form the basis 
by which c-BCB/BNNS appar- 
ently circumvents the Moss rule. 
Finally, the researchers report 
that BNNS increase the polymer’s 
thermal conductivity sixfold, 
and double its stiffness at tem- 
peratures above 150°C. These 
attributes will help c-BCB/BNNS 
composites to stay cool and retain 
electromechanical stability under 
continuous charge—discharge 
cycling, reducing or eliminating 
the need for auxiliary thermal- 
management systems. 

One drawback of the new 
composites is that BCB will be 
more expensive to produce than 
BOPP’s monomeric precursor, 
and blending BNNS into c-BCB 
represents an additional pro- 
cessing step compared with the 
manufacture of a pure polymer. 
It also remains to be seen what 
effects the BNNS will have on 
the number of defects and long- 
term reliability of the composites. 
Nonetheless, the relatively low 
filler loading of the nanocompos- 
ites, and the fact that irreversible cross-link- 
ing can be induced on heating or irradiation 
with ultraviolet light, will facilitate the devel- 
opment of ‘roll-to-rol? processes for produc- 
ing dielectric polymer films, which may help 
to control manufacturing costs for capacitors. 
If the advantages of the BNNS filler can be 
translated to other dielectric polymers that 
have higher polarizabilities than c-BCB, then 
even more remarkable advances in discharged 
energy density can be anticipated — as well as 
a reduction in the size of dielectric capacitors 
for electronics in power systems. = 
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Arresting developments 
in receptor signalling 


The first crystal structure of a G- protein-coupled receptor in complex with an 
arrestin protein provides insight into how the signalling pathways activated by 
these receptors are switched off through desensitization. SEE ARTICLE P.561 


JEFFREY L. BENOVIC 


-protein-coupled receptors (GPCRs) 
es an essential part in mediating 

signalling in the cells of many organ- 
isms. This process is primarily controlled by 
activation-dependent interactions of GPCRs 
with three protein families: heterotrimeric 
guanine-nucleotide-binding proteins (G pro- 
teins), GPCR kinases (GRKs) and arrestins. 
Until now, the only complete structure’ ofa 
GPCR complex was that of the B,-adrenergic 
receptor bound to the G protein Gs, solved 
in 2011. In this issue, Kang et al.” (page 561) 
present the second structure of a GPCR com- 
plex, in this case the GPCR rhodopsin bound 
to arrestin-1. 

The binding of aGPCR toa G protein results 
in the activation and subsequent regulation of 
downstream effector enzymes that modulate 
levels of ‘second messenger’ molecules such 
as cyclic AMP and calcium. By contrast, the 
interaction of aGPCR with a GRK promotes 
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phosphorylation of the receptor, which in 
turn facilitates arrestin binding. This turns off 
G-protein signalling, a process called desensi- 
tization, and promotes cellular internalization 
of the receptors and arrestin-mediated signal- 
ling. The dynamics of these protein-protein 
interactions are complex and incompletely 
understood. 

The best-characterized GPCR signalling 
pathway regulates the process of phototrans- 
duction in rod cells in the retina of the eye. 
This pathway involves rhodopsin, the G 
protein transducin and the effector enzyme 
cGMP phosphodiesterase (Fig. 1a). Photo- 
transduction is highly regulated by a GRK 
(GRK1) and an arrestin (arrestin-1); muta- 
tions in either of these proteins can lead to a 
visual defect called Oguchi disease. Rhodop- 
sin was the first GPCR to be crystallized in 
its basal state’ as well as in various activated 
conformations**. Although some structural 
insight into the binding of G proteins*® and 
arrestins’ to rhodopsin has been gained 


Figure 1 | The phototransduction pathway. a, The signalling pathway involving the G-protein-coupled 
receptor rhodopsin is initiated when light activation induces rhodopsin to form meta II rhodopsin, which 
interacts with the G protein transducin (Gt) to activate the effector enzyme cGMP phosphodiesterase. 
This pathway ultimately leads to a rapid visual response in rod cells. Meta II is then phosphorylated by 
the protein kinase GRK1 to yield P-meta II, which promotes interaction with the protein arrestin-1 to 
preclude further binding of Gt. P-meta II recycles by losing its bound chromophore (all-trans-retinal) 

to become P-opsin (not shown), which promotes arrestin-1 dissociation and dephosphorylation of the 
receptor. Opsin then binds the chromophore 11-cis-retinal to yield rhodopsin. Each of the individual 
proteins in this pathway (rhodopsin, meta II, opsin, Gt, GRK1 and arrestin-1) have been crystallized. 

b, Kang et al.’ provide the first crystal structure of a protein complex in this pathway, that of activated 
rhodopsin (blue) bound to arrestin-1 (red). Sections of arrestin-1 involved in the interface with 
rhodopsin are shown in yellow. The carboxy terminus of rhodopsin, which is phosphorylated by GRK1, is 
shown in purple. Rhodopsin is depicted in a phospholipid bilayer. 
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by using peptides co-crystallized with the 
receptor, Kang and colleagues’ structure of the 
full protein complex is a significant advance. 

Solving the X-ray structure of a rhodopsin- 
arrestin complex proved challenging and 
required a team of 72 investigators across 
25 institutions, who used various tricks to 
obtain diffractable crystals. First, the two 
proteins were mutated to aid the formation 
of active conformations. For rhodopsin, these 
mutations (E113Q and M257Y) yielded a con- 
formation that was constitutively active even 
in the absence of a bound chromophore (a 
colour-determining chemical group, such as 
all-trans-retinal, which typically activates rho- 
dopsin). For arrestin, the researchers mutated 
three adjacent amino-acid residues in a region 
that stabilizes the basal conformation and 
largely overcomes the need for the receptor to 
be phosphorylated to bind arrestin®. 

The authors were unable to generate a 
stable rhodopsin-arrestin binary complex 
(one formed through non-covalent bind- 
ing). Instead, they purified and crystallized a 
fusion protein in which arrestin was fused to 
the carboxy terminus of rhodopsin through 
a 15-residue linker. This fusion protein also 
included the enzyme T4 lysozyme at the 
amino terminus of rhodopsin — this facilitates 
crystallization without altering the structure 
of the complex. Finally, because the crystals 
were small and diffracted to 6-8 angstrém in 
synchrotron experiments, the authors used 
serial femtosecond X-ray laser crystallography, 
performed at the SLAC National Accelerator 
Laboratory in Menlo Park, California, to 
capture diffraction data for the crystals. 
This provided enough data to solve a struc- 
ture of the complex with resolution limits 
of 3.3-3.8 A. 

The structure reveals multiple points of 
contact between rhodopsin and arrestin, as 
well as structural changes in both proteins 
(Fig. 1b). The primary interface between the 
proteins involves the finger loop of arres- 
tin (which connects B-strands V and VI in 
arrestin and adopts an a-helical conforma- 
tion when bound), which interacts with three 
regions of rhodopsin: intracellular loop (ICL) 
1, the N-terminal region of helix 8, and the 
C-terminal region of transmembrane (TM) 7. 
Additional interactions include several arrestin 
loops: the middle and lariat loops bind to ICL2 
on rhodopsin, the back loop binds to TM5, and 
B-strand VI binds to TM5, TM6 and ICL3. 

The authors’ basic model proposes that 
rhodopsin uses multiple structural elements 
including TM7 and helix 8 to initially recruit 
arrestin, resulting in a rotation of approxi- 
mately 20° between the N and C domains of 
arrestin; this opens a cleft to accommodate 
rhodopsin’s ICL2. Indeed, a similar rota- 
tion between the N and C domains has been 
observed’ in a preactivated truncated form of 
arrestin-1 and in B-arrestin-1 (also known as 
arrestin-2) when bound to a phosphorylated 


receptor peptide’. Kang et al. extensively 
validated their rhodopsin-arrestin structural 
model by using double electron-electron 
resonance, hydrogen—deuterium exchange 
mass spectrometry, cell-based rhodopsin-— 
arrestin interaction assays, and site-specific 
disulfide cross-linking experiments. 

Kang and colleagues’ study provides insight 
into the interactions between GPCRs and 
arrestins, but much remains to be learned. We 
clearly need the structures of more GPCR com- 
plexes with G proteins, arrestins and GRKs, 
including complexes of such proteins with the 
same GPCR — for example, the B,-adrenergic 
receptor in complex, separately, with Gs, GRK2 
and f-arrestin-1, or rhodopsin in complex 


GENETICS OF DISEASE 


with transducin and GRK1 to complement the 
arrestin-1 structure. Such studies will reveal 
whether these three classes of protein have 
specific preferences for particular receptor 
conformations, and should facilitate the devel- 
opment of compounds that might serve as 
selective modulators of specific GPCR signal- 
ling pathways. This would be an important step 
in helping to treat the many diseases that are 
mediated by GPCR signalling pathways, which 
are currently the target for approximately 40% 
of the pharmaceuticals on the market. m 
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Associations 
with depression 


Two genetic regions associated with major depressive disorder have been 
revealed for the first time, through whole-genome sequencing of a population 


of Han Chinese women. SEE LETTER P.588 


PATRICK F. SULLIVAN 


f all complex human illnesses, major 

depressive disorder (MDD) has argu- 

ably proved the trickiest to under- 
stand. Despite decades of research, there is 
little certainty about its biological basis, in 
part because genetic clues to its aetiology 
have been hard to find’. The combination of 
relatively high prevalence and relatively low 
heritability seems to indicate that MDD does 
not lend itself to genetic analysis, although the 
genetic dissection of type 2 diabetes mellitus, 
which has a similar prevalence and heritability, 
has been much more productive. On page 588 
of this issue, the CONVERGE consortium? 
identifies the first two long-awaited genetic 
associations for MDD. 

Our ignorance about MDD is in marked 
contrast to its impact on people and public 
health’. The disease is common, costly and 
associated with high rates of morbidity and 
mortality. As such, it stands to reason that 
this research is exciting for those who study 
MDD. But it also exemplifies a sometimes 
neglected issue — how an informed approach 
to improving the definition of a complex illness 
can lead to success where other approaches 
have failed. 

Why is defining MDD so complicated? 
Sadness is normal and integral to the human 
condition. However, much too frequently, sad- 
ness becomes pervasive, persistent, unshak- 
able and associated with signs and symptoms 


characteristic of MDD, such as changes in 
sleeping habits, appetite and cognition, and the 
onset of suicidal tendencies. But where should 
we draw the line between normality and 
pathology? This question is echoed through- 
out medicine, for example when using normal 
fasting blood glucose levels to delineate nor- 
mal physiology from that of type 2 diabetes, or 
when separating normal blood pressure from 
hypertension. The difference is that the meas- 
ures for these latter two conditions are more 
objective than those 


Aninformed for assessing ‘sadness, 
approach to and the consequences 
improving of each disease more 
the definition readily assessed. 
of a complex There is no laboratory 
illness can test that will help us to 
lead to success know when sadness 
where other becomes MDD. 

approaches The CONVERGE 
have failed. consortium authors 


reasoned that the core 
issue hampering the discovery of MDD-asso- 
ciated genes is heterogeneity — in a group of 
people who all have the same MDD symptoms, 
the aetiology of the MDD may in fact be differ- 
ent. Some people might have a highly genetic 
form of the disease, whereas in others, MDD 
may be brought on by environmental factors 
such as poverty, physical or sexual abuse, or 
an unhealthy lifestyle. Still others may have 
a primary problem such as alcoholism, of 
which MDD isa secondary consequence. This 
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long-held concept of heterogeneity has made 
defining ‘true’ MDD something of a holy grail. 
The researchers made a set of intelligent 
decisions when defining who to study. They 
reasoned that more-severe cases would have a 
clearer and less-complex genetic signal — an 
approach widely used in human genetics to 
minimize heterogeneity in complex illnesses. 
Unlike more-inclusive approaches’, the 
consortium authors implemented several 
measures that they thought would maximize 
their chances of success. They worked in 
China, where the prevalence of MDD is lower 
than that in the United States or Europe, 
studied only women and selected relatively 
severe cases, in which the women had experi- 
enced two or more episodes of MDD, using 
psychiatric inpatient and outpatient facili- 
ties. Unusually, they genotyped their samples 
using low-coverage whole-genome sequencing 
(genotyping determines the identity of genetic 
variations across the genome). They identified 
two regions in which genetic changes, or vari- 
ants, are associated with MDD — one near the 
SIRT1 gene and the other in an intron (a non- 
protein-coding region) of the gene LHPP. 
The typical approach to genotyping in 
human genetics is to use an array contain- 
ing a fixed set of between 500,000 and 
1 million genetic markers — DNA variants at 
known chromosomal locations. To my know- 
ledge, this is the only published study in which 
genotyping involved low-coverage sequencing 
of the whole genome. Because of decreases in 
the costs of genotyping arrays, it may be one of 
the last. The authors’ low-coverage sequencing 
had relatively high error rates — around 2% of 
the genetic variants that they identified could 
not be replicated with a different method, com- 
pared with less than 0.5% for an inexpensive 
array. Moreover, despite their wish to gain 
traction on MDD-causing genetic associations 
that have yet to be described in China, the two 
variants that they found have been in standard 
databases for a decade. This is because genetic 
variants such as these, which are common in 
China, are evolutionarily old, and so likely to 
be found across the globe. Wisely, the authors 
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confirmed the variants with a second method 
and replicated the results in an independ- 
ent sample, ensuring that their associations 
meet typical standards for significance and 
replication. 

The consortium suggests that the proximity 
of one of the variants to SIRT1 implicates 
abnormalities in mitochondria (the cell’s 
energy-producing centres) in MDD, because 
one role of the SIRT1 protein is to regulate 
mitochondrial function. If this assertion holds 
up, it is likely to imply that many other genetic 
variants are involved in altered mitochondrial 
function and have associations with MDD that 
near the threshold for significance. A standard 
way to evaluate such a hypothesis is to investi- 
gate whether a specific genetic pathway, such as 
that involving the genes that affect mitochon- 
drial function, is statistically more likely to have 
smaller P-values for association with MDD 
than expected by chance. This analysis was 
not reported in the current study. A previous 
systematic pathway analysis of MDD and other 
major psychiatric disorders did not implicate 
mitochondrial biology’. As such, although the 
researchers’ hypothesis is intriguing, it requires 
replication, extension, integrated analysis and 
more biological evidence. 

I wish the authors had formally tested their 
fundamental premise, namely that their sam- 
ple was more homogeneous than those studied 
previously. If that is the case, then the herit- 
ability of the common variants (the propor- 
tion of variance contributing to MDD that 
can be accounted for by the genetic variation 
they measured) should be notably high. But 
although several methods exist to check this®”, 
the authors did not report such an analysis. 

More unsettling, the two variants identified 
have almost no association signal in samples 
taken from European populations®. The rea- 
sons for this discrepancy are unclear. Perhaps 
these variants are truly causative for MDD only 
in severe cases from China. However, many 
other common associations for complex ill- 
nesses hold across the world. The authors 
tested the comparability of their findings with 
European samples and found some evidence 
of overlap, but more-refined analyses would 
be of keen interest. 

This first identification of replicable, 
significant genome-wide associations for 
MDD is exceptional. Although further work 
is required, it is to be hoped that these results 
will provide therapeutic targets for MDD. The 
drug-discovery pipeline for MDD has never 
been based on solid biological foundations, but 
the work begun here could improve the focus 
of the field. 

The authors’ study marks the beginning 
of the beginning for the genetic dissection 
of MDD. The CONVERGE consortium 
has provided an excellent starting point 
for what should be an intriguing voyage 
of discovery. m See go.nature.com/Isghoc for a related 
News story. 
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Cataracts dissolved 


Mutations underlying hereditary cataracts in two families impair the function of 
an enzyme that synthesizes the lens molecule lanosterol. The finding may lead to 
non-surgical prevention and treatment of cataracts. SEE LETTER P.607 


J. FIELDING HEJTMANCIK 


mutation in the gene encoding the enzyme 
lanosterol synthase (LSS) as the cause of 
inherited cataracts in two families. LSS, which 
is produced in the lens, synthesizes lanosterol, 
a molecule that is amphipathic (that is, it has 
both hydrophilic and hydrophobic properties). 
The authors show that lanosterol can dissolve 
the precipitates, and even the amyloid-like fibril 
structures, of mutant lens crystallin proteins 
that are the cause of cataracts in some individu- 
als. Furthermore, lanosterol effectively treated 
naturally occurring cataracts in rabbit lenses 
and in dogs in vivo. In addition to elucida- 
ting the visual process, this work promises to 
continue in the tradition of lens research by 
expanding scientific insight into broad and 
often seemingly unrelated areas of enquiry. 
The eye lens has been intensively studied for 
almost two centuries. In 1833, optics scientist 
David Brewster deduced the fine structure 
of the cod lens, calculating that it contained 
5 million fibre cells, each 4.8 millimetres long, 
using only a candle and a finely ruled steel 
bar’. In 1901, embryologist Hans Spemann’s 
study of lens development resulted in the con- 
cept of inductive cellular interactions during 
embryonic development’. Studies of lens bio- 
chemistry began in the late nineteenth century 
with descriptions of the high concentrations of 
heterogeneous structural proteins now known 
as crystallins*. Subsequently, one of the first 
genetic locations on a non-sex chromosome 
to be associated with disease was linked to 
cataract susceptibility’, and messenger RNA 
molecules encoding chicken lens 5-crystallins 
were among the first mRNAs to be isolated, 
cloned and studied’. 
The function of the eye lens is to 


[: this issue, Zhao et al.’ (page 607) identify a 
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transmit light and focus it on the retina. The lens 
accomplishes this through a single cell type that 
follows a developmental pattern, beginning as 
a member of the germinative zone in the sin- 
gle layer of anterior epithelial cells overlaying 
a mass of fibre cells. The epithelial cells then 
migrate laterally towards the lens equator, where 
they elongate and invert to form secondary fibre 
cells, arranged in a curved, onion-like configu- 
ration. As they do this, the cells synthesize 
large amounts of crystallins, such that they 
contain perhaps the highest concentration of 
proteins found in any tissue. They also degrade 
organelles, minimize extracellular space and 
increase the density of their cell membranes to 
levels approaching that of the cell’s cytoplasm, 
all of which decrease light scattering’. Thus, 
transparency is accomplished largely through a 
combination of the microarchitecture of the lens 
and, on a molecular level, the densely packed 
lens crystallins (Fig. 1). 

Human crystallins are divided into two 
families, a- and By-crystallins; together, these 
make up 90% of the water-soluble proteins in 
lens cells*. They are extremely stable, highly 
ordered and provide a relatively constant 
refractive index, which allows lens transpar- 
ency’. Because differentiated lens fibre cells 
lack the synthetic apparatus to produce new 
proteins, crystallins are not turned over, and 
those in the centre of the lens are among 
the oldest proteins in the body. Preserving 
crystallin structure and function is therefore 
crucial for prevention of lens opacities. Other 
biological activities of the lens serve primar- 
ily to protect the complementary systems of 
crystallin packing and fibre-cell arrange- 
ment from disruption and damage by age and 
external insults, especially ultraviolet light, 
oxidative stress and glycation. 

The genes that cause cataracts when mutated 


PAUL FITZGERALD/REF. 16 


tend to encode proteins that are 
involved in one of these biological 
pathways or functional groupings 
of proteins that are critical for lens 
homeostasis. In families at risk of con- 
genital cataracts for which the mutant 
gene is known, slightly less than half 
have mutations in lens crystallins, with 
others having mutations in growth 
or transcription factors, membrane 
proteins, chaperone proteins and pro- 
teins involved in protein degradation, 
among others’”. Zhao and colleagues’ 
identification of LSS mutations as a 
cause of congenital cataracts adds a 
new pathway. 

The catastrophic structural changes 
in crystallins seen in many hereditary 
cataracts can overwhelm the defen- 
sive systems of the lens, and might 
also be refractory to the solubiliz- 
ing activity of lanosterol identified 
by the authors. Nevertheless, this 
agent might be more therapeuti- 
cally applicable to the slow pro- 
gressive denaturation of crystallins seen in 
age-related cataracts. In age-related cataracts, 
damaged By-crystallin proteins are bound by 
a-crystallins, which act like chaperones — 
proteins that assist the folding or unfolding 
of other proteins — except that, instead 
of refolding the denatured By-crystallins, 
a-crystallins solubilize them", thereby 
reducing light scattering. However, as more 
crystallins are damaged and bound over time, 
the protein complexes themselves become 
large enough to scatter light’’””. Eventually, 
the complexes precipitate, forming the insol- 
uble protein fraction (termed high molecular 


STRONG-FIELD PHYSICS 


Figure 1 |The eyelens. This cross-section of a mouse eye 
lens shows the curved, onion-like configuration of fibre cells, 
which are packed close together and lose subcellular structures 
such as nuclei (coloured blue) as they mature and move 

to the centre of the lens. Fibre cells contain highly ordered 
crystallin proteins, the intracellular concentration of which 
increases towards the interior of the lens (seen as darkening 
pink). This combined cellular and intracellular structure 
gives transparency to the lens. Denaturation and aggregation 
of crystallin proteins can result in the lens opacity known as 
acataract. Zhao et al.' show that the molecule lanosterol can 
redissolve crystallin aggregates and alleviate cataracts. 


weight aggregates) that increases with normal 
ageing and especially in cataractous lenses. 
This identifies cataracts, in at least some cases, 
as a protein-misfolding disease”’. 

Although surgery to remove cataracts is effi- 
cacious and safe, ageing populations around 
the world are predicted to require a doubling 
of cataract surgery in the next 20 years"*. The 
same population demographics suggest that, if 
development of age-related cataracts in suscep- 
tible individuals could be delayed by even ten 
years, the need for surgery could be reduced 
by almost half'*. Pre-symptomatic screen- 
ing of age-related cataracts is easy, and the 


Harmonic radiation 
from crystals 


Electrons in a crystal can tunnel between energy bands when a strong electric field 
is switched on. It emerges that electron pathways interfere almost instantaneously, 
giving rise to ultra-short, pulsed emission of light. SEE LETTER P.572 


PETER HOMMELHOFF & TAKUYA HIGUCHI 


r Vhe puzzling but experimentally verified 
fact that particles can propagate 
through walls is a hallmark of quantum 

mechanics. In crystalline solids, the motion 

of electrons is restricted by the presence of 
the atomic lattice, which limits their energy 
to certain ranges known as energy bands. 

Because an electron’s energy cannot exceed 

these limits, gaps are formed between bands. 


In the heyday of quantum theory, the physicist 
Clarence Zener showed’ that the electrons ina 
solid that is subjected to a strong electric field 
can tunnel between energy bands, traversing 
the classically imposed barrier. On page 572 
of this issue, Hohenleutner et al.” report an 
experimental and theoretical study showing 
that, when a strong electric field oscillating 
at terahertz frequencies (1 THz is 10'* Hz) 
is applied to a crystal, various bands can be 
coupled together by electron tunnelling, and 
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eye is easily accessible for topical application 
of drugs. Zhao and colleagues show that eye 
drops containing lanosterol successfully 
treated natural cataracts in dogs. The poten- 
tial for this finding to be translated into the 
first practical pharmacological prevention, or 
even treatment, of human cataracts could not 
come at a more opportune time. Furthermore, 
this approach might serve as a model for other 
protein-misfolding diseases affecting a variety 
of tissues and organ systems. = 
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the crystal emits ultra-short bursts of high- 
harmonic radiation. 

The frequency of this oscillating driving 
field, which corresponds to mid-infrared 
wavelengths, is much lower than any of the fre- 
quencies required for straightforward electron 
jumps between different energy bands. Because 
of its large strength, however (greater than 
1 volt per nanometre), the field drives electrons 
to tunnel from one band to another** on femto- 
second timescales (1 fs is 10°!’ seconds), that is, 
almost instantaneously with the switching-on 
of the field. The electrons’ dynamics are com- 
plex and include not only tunnelling to differ- 
ent energy bands, but also acceleration within 
each band; these processes result in the radia- 
tion of electromagnetic waves at a much higher 
frequency than that of the driving terahertz- 
frequency field. The emitted radiation is called 
high-harmonic radiation because its spectrum 
usually displays peaks at harmonics (integer 
multiples) of the driving field’s frequency, 
reflecting the field’s temporal periodicity. 

The observed high-harmonic radiation 
consists of pulses of ultra-broadband visible 
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Figure 1 | Electron tunnelling and interference. The electrons ina solid form energy bands (here, 1-3) 
and can tunnel between these when a strong, terahertz-frequency field is applied. Band 1 can be reached 
directly from band 2 (solid arrow), but also through the path 2 to 3 to 1 (dotted arrows). The sign (+ or -) 
of the electrons’ wavefunctions (®; solid and dotted blue lines) after each tunnelling event is determined by 
the polarity of the applied field (one cycle, shown in green). Hohenleutner et al.” observe that the electrons’ 
wavefunction does not change sign after each tunnelling event for positive field polarity (left), but does 
change sign for negative polarity (right). As a result, for positive polarity the wavefunctions interfere 
constructively (red line in band 1, left), whereas for negative polarity they interfere destructively (red line 
in band 1, right). Consequently, for positive polarities of the applied field only, the crystal emits a pulse of 
radiation (not shown) at frequencies corresponding to high harmonics of the applied field’s frequency. 


and infrared light that are only several femto- 
seconds long. These pulses reveal the signature 
of the electronic states populated by the tunnel- 
ling process and allow accurate tracing of the 
dynamics of the crystal’s electrons. The authors 
record the high-harmonic radiation that is emit- 
ted from a gallium selenide crystal subjected to 
pulses centred at a frequency of about 30 THz. 
Their experiment measures the structure of the 
high-harmonic spectrum with femtosecond 
precision, providing greater insight into the 
electron dynamics than would be possible by 
measurements of the spectrum without ultra- 
fast temporal resolution. Using sophisticated 
optical techniques, the authors are able to pin- 
point the moment at which the high harmon- 
ics are generated during a terahertz-frequency 
pulse that is only a few cycles long. 

These results would have been hard to inter- 
pret without the in-depth theoretical under- 
standing and modelling that Hohenleutner 
and colleagues use to complement their exper- 
iment. The data can be explained by invoking 
inter-band tunnelling, but the authors show 
that more than two electron energy bands are 
involved — five are required. Moreover, quan- 
tum interference between the various tunnel- 
ling paths needs to be invoked for a proper 
explanation of the observations. In classic 
interference, waves combine constructively 
or destructively depending on whether they 
arrive at a particular spot in or out of phase, 
which hinges on the difference between the 
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distances the waves have covered. Analogously, 
in this experiment, different electron-excita- 
tion pathways give rise to quantum interfer- 
ence and couple the five energy bands together. 

Using numerical modelling, the authors arti- 
ficially ‘switched off’ the interference between 
the tunnelling pathways, demonstrating that 
the quantum interference is essential to fit the 
experimental data. The modelling also shows 
that the polarity of the driving electric field 
determines whether the interference is con- 
structive or destructive (Fig. 1). It turns out 
that high-harmonic radiation is emitted only 
when the driving electric field is at its peak, 
and for one polarity of the field only. When 
this happens, the intensity of the emitted high- 
harmonic pulse is enhanced by a factor of 30, 
owing to constructive interference between the 
tunnelling pathways, compared with the case 
without interference. 

High-harmonic emission in solids has been 
a vibrant area of research since it was shown 
that the excitation of crystals by a strong, long- 
wavelength electromagnetic field generates 
radiation at high-harmonic frequencies of the 
driving field’s fundamental frequency”*. The 
strong-field physics of individual atoms in the 
gas phase is well understood, but the different 
scaling of the maximum harmonic frequency 
with the strength of the incident field in solids 
had hinted at different underlying processes’. 
However, a recent comparative study’ found 
that some aspects of the gas-physics picture 
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apply to solids too, such as the importance of 
controlling the electrons’ pathways so as to 
generate high-harmonic radiation. In another 
study’, researchers applied optical pulses of 
sub-period duration to a silicon dioxide crystal 
and showed that the observed high-harmonic 
radiation reaches frequencies of 8,500 THz 
(corresponding to the extreme- ultraviolet 
domain), a record value for intra-band cur- 
rents induced ina solid. 

From insights into such observations, new 
ways may be devised to control the phase of the 
electron wavefunction in crystals, for instance 
through the instantaneous modification of 
the band structure by external electric fields. 
Related work with solids subjected to strong 
fields has also shown that the material may 
be reversibly changed from being a dielectric 
(insulator) to a semi-metal (conductor) ina 
femtosecond”. 

These studies, and Hohenleutner and 
colleagues’ work in particular, open the door to 
using electrons in solids as a quantum-physics 
playground. For example, fully understanding 
the radiation mechanism will allow us to infer 
the electron wavefunctions from the emitted 
high harmonics. This might enable the tomo- 
graphic reconstruction of a crystal’s electronic 
band structure. Moreover, the ultra-short time- 
scales observed in this work could open up new 
ways of quantum information processing, in 
which information will be encoded in the elec- 
trons’ wavefunctions. Other applications can 
also be envisaged, including the generation of 
intense sources of coherent extreme-ultraviolet 
radiation. Because of the complex nature of any 
condensed material, further strong-field studies 
of crystals, disordered solids, and even liquids 
might lead to other surprise discoveries. m 
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WOWNAORWHE 


CORRECTION 

The News & Views article ‘Astrochemistry: 
Fullerene solves an interstellar puzzle’ by 
Pascale Ehrenfreund and Bernard Foing 
(Nature 523, 296-297; 2015) omitted the 
relevant reference citations and full credit 
information for Figure 1. This has now been 
corrected online. 


ARTICLE 


doi:10.1038/nature14565 


Timing and climate forcing of volcanic 
eruptions for the past 2,500 years 


M. Sigl't, M. Winstrup’, J. R. McConnell!, K. C. Welten?, G. Plunkett*, F. Ludlow’, U. Biintgen®”’, M. Caffee?!°, N. Chellman’, 
D. Dahl-Jensen", H. Fischer”’, S. Kipfstuhl'’, C. Kostick'*, O. J. Maselli!, F. Mekhaldi!>, R. Mulvaney’®, R. Muscheler’®, 
DLR. Pasteris!, J.R. Pilcher*, M. Salzer", S. Schtipbach’””, J. P. Steffensen", B. M. Vinther! & T. E. Woodruff? 


Volcanic eruptions contribute to climate variability, but quantifying these contributions has been limited by inconsis- 
tencies in the timing of atmospheric volcanic aerosol loading determined from ice cores and subsequent cooling from 
climate proxies such as tree rings. Here we resolve these inconsistencies and show that large eruptions in the tropics and 
high latitudes were primary drivers of interannual-to-decadal temperature variability in the Northern Hemisphere 
during the past 2,500 years. Our results are based on new records of atmospheric aerosol loading developed from 
high-resolution, multi-parameter measurements from an array of Greenland and Antarctic ice cores as well as 
distinctive age markers to constrain chronologies. Overall, cooling was proportional to the magnitude of volcanic 
forcing and persisted for up to ten years after some of the largest eruptive episodes. Our revised timescale more 
firmly implicates volcanic eruptions as catalysts in the major sixth-century pandemics, famines, and socioeconomic 
disruptions in Eurasia and Mesoamerica while allowing multi- millennium quantification of climate response to volcanic 


forcing. 


Volcanic eruptions are primary drivers of natural climate variabil- 
ity—their sulfate aerosol injections into the stratosphere shield the 
Earth’s surface from incoming solar radiation, leading to short-term 
cooling at regional-to-global scales'. Temperatures during the past 
2,000 years have been reconstructed at regional’, continental*, and 
global scales* using proxy information from natural archives. Tree- 
ring-based proxies provide the vast majority of climate records from 
mid- to high-latitude regions of (predominantly) the Northern 
Hemisphere, whereas ice-core records (for example, 5'*O) represent 
both polar regions’. 

Climate forcing reconstructions for the Common Era (cE)— 
including solar (for example, '°Be)° and volcanic (for example, sulfate)*” 
activity—derive mostly from ice-core proxies. Any attempt to attrib- 
ute reconstructed climate variability to external volcanic forcing, and 
to distinguish between response, feedback, and internal variability of 
the climate system, requires ice-core chronologies that are synchron- 
ous with those of other climate reconstructions. In addition, multi- 
proxy climate reconstructions** derived from ice cores and other 
proxies such as tree rings will have diminished high- to mid-frequency 
amplitudes if the individual climate records are on different time- 
scales. 

The magnitudes and relative timing of simulated Northern 
Hemisphere temperature responses to large volcanic eruptions are 
in disagreement with reconstructed temperatures obtained from tree 
rings*’, but it is unclear to what extent this model/data mismatch 
is caused by limitations in temperature reconstructions, volcanic 
reconstructions, or implementation of aerosol forcing in climate 
models”"''. The hypothesis of chronological errors in tree-ring-based 


temperature reconstructions*” offered to explain this model/data mis- 
match has been tested and widely rejected’*"*, while new ice-core 
records have become available providing different eruption ages’*"® 
and more precise estimates of atmospheric aerosol mass loading’” 
than for previous volcanic reconstructions. 

Using documented” and previous ice-core-based eruption ages"®, 
strong summer cooling following large volcanic eruptions has been 
recorded in tree-ring-based temperature reconstructions during the 
second millennium ce with a one-to-two year lag similar to that 
observed in instrumental records after the 1991 Pinatubo eruption”. 
An apparent seven-year delayed cooling observed in individual tree- 
ring series relative to Greenland ice-core acidity peaks during the first 
millennium cE, however, suggests a bias in existing ice-core chronolo- 
gies?'. Using published ice-core chronologies, we also observed a 
seven-year offset between sulfate deposition in North Greenland and 
the start of tree-ring growth reduction in a composite of five multi- 
centennial tree-ring summer temperature reconstructions (‘N-Tree’) 
from the Northern Hemisphere between 1 and 1000 cE (Methods), 
whereas tree-ring response was effectively immediate for eruptions 
occurring after 1250 cE (Fig. 1a). 


Precise time marker across hemispheres 


Independent age markers with which to test the accuracy of tree-ring 
and ice-core chronologies have recently become available with the 
detection of abrupt enrichment events in the '*C content of tree rings. 
Rapid increases of atmospheric '*C were first identified in individual 
growth increments of cedars from Japan between 774 CE and 775 CE” 
and between 993 cE and 994 cz”. The presence and timing of 
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Figure 1 | Annual }°Be ice-core records and post-volcanic cooling from 
tree rings under existing ice-core chronologies. a, Superposed epoch analysis 
for the largest volcanic signals in NEEM-2011-S1 between 78 and 1000 cE 

(n = 7; orange trace) and for the largest eruptions between 1250 and 2000 cE 
(n = 10; grey trace)'®. Shown are standardized growth anomalies (z scores 
relative to 1000-1099 cz) from a multi-centennial, temperature-sensitive 
tree-ring composite (N-Tree*”**’*7*, Methods) ten years after the year of 
volcanic sulfate deposition at the NEEM ice core site in Greenland (GICC05 
timescale), relative to the level five years before sulfate deposition. b, Annually 
resolved !°Be concentration records from the WDC, TUNU2013, NGRIP, and 
NEEM-2011-S1 ice cores on their original timescales and annually resolved 
AC series from tree-ring records between 755 cE and 795 cE”, with green 
arrows representing the suggested time shifts for synchronization; error bars 
are lo measurement uncertainties; the estimated relative age uncertainty 

for TUNU2013 at this depth interval from volcanic synchronization with 
NEEM-2011-S1 is +1 year. c, Annually resolved '°Be concentration record 
from NEEM-2011-S1 ice core on its original timescale and annually resolved 
A'C series from tree rings in 980 cE and 1010 ce”; error bars are 1o 
measurement uncertainties. 


the event in 775 cE (henceforth, the 775 event) has been reproduced 
by all radiocarbon measurements performed on tree rings at annual 
(or higher) resolution—including tree cores from Germany”, the 
Alps’, the Great Basin” (USA), and Siberia’. Recent identification 
of the same 775 CE event in kauri wood samples from New Zealand in 
the Southern Hemisphere demonstrates the global extent of the rapid 
"SC increase and provides further constraints on the event’s exact 
timing (March 775+6 months) owing to the asynchronous 
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Southern Hemisphere growing season’. While the cause of the 775 
and 994 events is still debated**”*”’, we expect that they might also 
have produced an excess of cosmogenic ‘°Be through the interaction 
of high-energy particles with atmospheric constituents*”’. Since both 
of these radionuclides are incorporated rapidly into proxy archives via 
aerosol deposition (‘°Be in ice cores) and photosynthesis (*CO, in 
tree rings), isotopic anomalies caused by these extraterrestrial events 
provide a global age marker with which to link ice-core records to 
tree-ring chronologies directly”’. The latter serve as an absolute and 
precise age marker, verified (at least since 775 CE) by the coherence 
of the rapid increase in ““C in all tree-ring records for which high- 
resolution radiocarbon analyses were performed, including those 
speculated to be at risk of missing rings®. 

We measured 'Be concentrations at approximately annual reso- 
lution in four ice cores—NEEM-2011-S1, TUNU2013, and NGRIP 
in Greenland, and the West Antarctic Ice Sheet Divide Core 
(WDC)—over depth ranges encompassing the year 775 CE as dated 
in existing ice-core chronologies in order to provide a direct, phys- 
ically based test of any dating bias in these chronologies (Fig. 1, 
Extended Data Fig. 1, Methods, Supplementary Data 1). Both polar 
ice sheets contain '°Be concentrations exceeding the natural back- 
ground concentration (>150%; 60) for one-to-two consecutive 
years, compatible with the 775 CE event observed in tree rings. 
Using the original ice-core age models!**°, the ages of the ‘°Be max- 
ima in NEEM-2011-S1, NGRIP, and WDC are 768 CE, offset by 7 
years from the tree-ring event. A further ‘°Be anomaly measured in 
NEEM-2011-S1 at 987 ce, compatible with the 994 cE event in tree 
rings, suggests that a chronological offset was present by the end of 
the first millennium ce (Fig. 1). Several different causes may have 
contributed to the offset (see a summary in the Methods section), 
among which is the use of a previous dating constraint* for 
Greenland, where volcanic fallout in the ice was believed to indicate 
the historic (79 cE) eruption of Vesuvius. 


Revised ice-core chronologies 


Given the detection of a bias in existing ice-core chronologies, we 
developed new timescales before the 1257 Samalas eruption in 
Indonesia*’ using highly resolved, multi-parameter aerosol concen- 
tration records from three ice cores: NEEM-2011-S1, NEEM, and 
WDC. We used the StratiCounter program, an automated, objective, 
annual-layer detection method based on Hidden Markov Model 
(HMM) algorithms” (Methods). For NEEM-2011-S1, the confidence 
intervals obtained for the layer counts allowed us to improve the 
dating further by constraining the timescale using the 775 cE '’Be 
anomaly and three precisely dated observations of post-volcanic aero- 
sol loading of the atmosphere (Fig. 2, Extended Data Tables 1-3, 
Methods, Supplementary Data $2). 

We evaluated the accuracy of our new chronologies ((WD2014’ 
for WDC and ‘NS1-2011’ for NEEM) by comparison to (1) an 
extensive database of historical volcanic dust veil observations 
(Extended Data Fig. 2, Methods, Supplementary Data 2), (2) ice-core 
tephra evidence (Methods), and (3) the 994 cE event (Methods, Fig. 2). 
Using the new timescales, we found large sulfate signals in Greenland 
(for example, in 682 CE, 574 cE, and 540 cE) between 500 cE and 2000 cE 
that frequently occurred within one year of comparable—and indepen- 
dently dated—signals in Antarctica. These bipolar signals can now be 
confidently attributed to large tropical eruptions (Fig. 2). Back to 400 
BCE, Other large sulfate peaks (for example, 44 BCE) were synchronous to 
within three years (Fig. 2). We conclude that the revised ice-core time- 
scales are accurate to within less than five years during the past 2,500 
years, on the basis of combined evidence from radionuclides, tree rings, 
tephra analyses, and historical accounts. Compared to the previous 
chronologies, age models differ before 1250 ce by up to 11 years 
(GICC05, Greenland) and 14 years (WDC06A-7, Antarctica) 
(Extended Data Fig. 3). 
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Figure 3 | Global volcanic aerosol forcing and Northern Hemisphere 
temperature variations for the past 2,500 years. a, 2,500-year record of tree- 
growth anomalies (N-Tree***’*’*; relative to 1000-1099 ce) and reconstructed 
summer temperature anomalies for Europe and the Arctic’ with the 40 
coldest single years and the 12 coldest decades based on N-Tree indicated. 
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b, Reconstructed global volcanic aerosol forcing from bipolar sulfate composite | Tambora 1815 eruption. 
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records from tropical (bipolar), Northern Hemisphere, and Southern 
Hemisphere eruptions. Total (that is, time-integrated) forcing values are 
calculated by summing the annual values for the duration of volcanic sulfur 
deposition. The 40 largest volcanic signals are indicated, and ages are given 
for events representing atmospheric sulfate loading exceeding that of the 
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History of volcanic forcing 
Employing our revised timescales and new high-resolution, ice-core 
sulfur measurements, we developed an extended reconstruction of 
volcanic aerosol deposition since early Roman times for both polar 
ice sheets, from which we then estimated radiative forcing using 
established transfer functions’® (Fig. 3, Methods, Supplementary 
Data 3-5). This forcing series is characterized by improved dating 
accuracy, annual resolution, and a larger number of ice-core records 
in the Antarctic ice-core sulfate composite’’ than in previous recon- 
structions®’. It spans 2,500 years, allowing investigation of climate- 
volcano linkages more accurately and earlier than with previous 
reconstructions. It also provides a perspective on volcanic influences 
during major historical epochs, such as the growth of Roman imperial 
power and subsequent decline during the “Migration Period’ (the early 
part of the first millennium cE) in Europe—times of (1) demographic 
and economic expansion as well as relative societal stability and 
(2) political turmoil and population instability, respectively’. With 
improved dating and lower volcanic-sulfate detection limits from 
the Antarctic array’’, we were able to detect, estimate, and attribute 
volcanic aerosol loading and forcing from 283 individual eruptive 
events during this period (Fig. 3). 

We attributed about half of these to mid- to high-latitude Northern 
Hemisphere sources, while 81 were attributed to tropical eruptions 
(having synchronous sulfate deposition on both polar ice sheets). 


These tropical volcanic eruptions contributed 64% of total volcanic 
forcing throughout the period, with five events exceeding the sulfate 
loading from the 1815 Tambora eruption in Indonesia (Fig. 3, 
Extended Data Table 4). Events in 426 BcE and 44 BCE rival the great 
1257 cE Samalas eruption as the largest sulfate-producing eruptions 
during this time. These two earlier events have not been widely 
regarded as large tropical eruptions with potential for strong climate 
impact’, owing to the lack of complete and synchronized sulfate 
records from Greenland and Antarctica. We base the claim that 
these two eruptions were tropical in origin and caused large radiative 
perturbations on the observation that ice cores from Greenland and 
Antarctica record coeval (within their respective age uncertainties) 
and exceptionally high volcanic sulfate concentrations. Both of these 
events were followed by strong and persistent growth reduction in 
tree-ring records™ (Fig. 2) as is typically observed after large tropical 
eruptions during the Common Era (Fig. 3). 


Post-volcanic summer cooling 

Superposed epoch analyses (Methods) performed on the ‘N-Tree’ 
composite record centred on the largest volcanic signals between 
5008BcE and 1250 cE as well as between 1250 cE and 2000 cE, show a 
clear post-volcanic cooling signal. For both periods, maximum tree- 
ring response lagged the date of initial increase of sulfate deposition by 
one year (Fig. 2), consistent with the response observed if using only 
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eruption, are shown. Dashed lines present twice 
the standard error of the mean (2s.e.m.) of the 
temperature anomalies associated with the 
multiple eruptions. Five-year average post-volcanic 
temperatures are shown for each reconstruction 
(lag 0 to lag +4 years, grey shading). 


historically documented eruptions with secure dating for the past 800 
years'®. The sharp and immediate (that is, less than one year lag time) 
response of tree growth to the ice-core volcanic signal throughout the 
past 2,500 years further corroborates the accuracy of our new ice-core 
timescales (Extended Data Fig. 4). 

Of the 16 most negative tree-growth anomalies (that is, the coldest 
summers) between 5008cE and 1000 cE, 15 followed large volcanic 
signals—with the four coldest (43 BCE, 536 CE,543 CE, and 627 CE) 
occurring shortly after several of the largest events (Extended Data 
Tables 4 and 5). Similarly, the coldest summers in Europe during 
the Common Era’ were associated with large volcanic eruptions 
(Extended Data Table 5). Reduced tree growth after volcanic 
eruptions was also prominent in decadal averages of the ‘N-Tree’ 
composite. All 16 decades with the most reduced tree growth for 
our 2,500-year period followed large eruptions (Fig. 3, Extended 
Data Table 5). In many cases, such as the coldest decade, from 
536 CE to 545 cE’, sustained cooling was associated with the combined 
effect of several successive volcanic eruptions. 

Strong post-volcanic cooling was not restricted to tropical erup- 
tions; it also followed Northern Hemisphere eruptions (Fig. 4), with 
maximum cooling in the year of volcanic-sulfate deposition. In con- 
trast to the average of the 19 largest CE tropical eruptions, however, the 
Northern-Hemisphere-only eruptions did not give rise to any notice- 
able long-term cooling effect (Fig. 4). The persistence of implied post- 
volcanic cooling following the largest tropical eruptions is strongly 
expressed in temperature reconstructions based on tree-ring width 
measurements (for example, those from the Alps), with recovery 
times of more than ten years. Persistent cooling, with temperature 
reduction notably below the pre-eruption baseline for six consecutive 
years, is also observed in temperature reconstructions based on 
maximum latewood density (for example, those from Northern 
Scandinavia), which is the preferred proxy with which to quantify 
volcanic cooling contributions on climate owing to its less marked 
biological memory effects* (Fig. 4). These findings indicate that erup- 
tion-induced climate anomalies following large tropical eruptions 
may last longer than is indicated in many climate simulations 
(<3-5 years)****’ and that potential positive feedbacks initiated after 


ARTICLE 


large tropical eruptions (for example, sea-ice feedbacks) may not be 
adequately represented in climate simulations*”’. 

The five-year averaged (lag 0 to lag 4 years) cooling response 
over three Northern Hemisphere regions (Methods) following the 
19 largest Common Era tropical eruptions was —0.6 + 0.2 °C (two 
standard errors of the mean, 2 s.e.m.), and that of large Northern 
Hemisphere eruptions was —0.4 + 0.4 °C, with the strongest cooling 
induced in the high latitudes. Overall, cooling was proportional to the 
magnitude of volcanic forcing, with stratospheric sulfate loading 
exceeding that of the Tambora eruption inducing the strongest res- 
ponse of —1.1 + 0.6 °C (Figs 3 and 4). 


Global climate anomalies in 536-550 CE 


Our new dating allowed us to clarify long-standing debates concern- 
ing the origin and consequences of the severe and apparently global 
climate anomalies observed in the period 536-550 cE, which began 
with the recognition of the “mystery cloud” of 536 cE” observed in the 
Mediterranean basin. Under previous ice-core dating, it has been 
argued that this dust veil corresponded to an unknown tropical erup- 
tion dated 533-534 cE (+2 years)“. Using our revised timescales, we 
found at least two large volcanic eruptions around this period (Fig. 5). 
The first eruptive episode in 535 cE or early 536 CE injected large 
amounts of sulfate and ash into the atmosphere, apparently in the 
Northern Hemisphere. Geochemistry of tephra filtered from the 
NEEM-2011-S1 ice core at a depth corresponding to 536 CE indicated 
multiple North American volcanoes as likely candidates for a com- 
bined volcanic signal (Extended Data Fig. 5, Methods, Supplementary 
Data 5). Historical observations (Extended Data Table 3) identified 
atmospheric dimming as early as 24 March 536 cE, and lasting up to 
18 months. The summer of 536 CE appeared exceptionally cold in all 
tree-ring reconstructions in the extra-tropical Northern Hemisphere 
from North America™, over Europe***** to Asia“*. Depending on the 
reconstruction method used, European summer temperatures in 
536 CE dropped 1.6-2.5 °C relative to the previous 30-year average’. 
The second eruptive episode in 539 cE or 540 CE, identified in both 
Greenland and Antarctica ice-core records and hence probably trop- 
ical in origin, resulted in up to 10% higher global aerosol loading than 
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the Tambora 1815 eruption reconstructed from our bipolar sulfate 
records. Summer temperatures consequently dropped again, by 
1.4-2.7 °C in Europe in 541 ce’, and cold temperatures persisted in 
the Northern Hemisphere until almost 550 cr***** (Figs 2, 3, 5). 

This provides a notable environmental context to widespread fam- 
ine and the great Justinian Plague of 541-543 cE that was responsible 
for decimating populations in the Mediterranean and potentially 
China*°. Although certain climatic conditions (for example, wet 
summers) have been linked to plague outbreaks in the past’, a direct 
causal connection of these two large volcanic episodes and subsequent 
cooling to crop failures and outbreaks of famines and plagues is dif- 
ficult to prove*’. However, the exact delineation of two of the largest 
volcanic signals—with exceptionally strong and prolonged Northern 
Hemisphere cooling, written evidence of famines and pandemics, 
as well as the socio-economic decline observed in Mesoamerica 
(the “Maya Hiatus”**), Europe, and Asia—supports the idea that 
the latter may be causally associated with volcanically induced 
climatic extremes. 

Detailed study of major volcanic events during the sixth century 
(Fig. 5) and an assessment of post-volcanic cooling throughout the 
past 2,500 years using stacked tree-ring records and regional temper- 
ature reconstructions (Fig. 4, Extended Data Fig. 4) demonstrated that 
large eruptions in the tropics and high latitudes were primary drivers 
of interannual-to-decadal Northern Hemisphere temperature vari- 
ability. The new ice-core chronologies imply that previous multi- 
proxy reconstructions of temperature that include ice-core records** 
have diminished high- to mid-frequency amplitudes and must be 
updated to accurately capture the timing and full amplitude of palaeo- 
climatic variability. 

By creating a volcanic forcing index independent of but consistent 
with tree-ring-indicated cooling, we provide an essential step towards 
understanding of external forcing on natural climate variability dur- 
ing the past 2,500 years. With the expected detection of additional 
rapid AC enrichment events from ongoing efforts in annual-reso- 
lution '“C tree-ring analyses“, there will be opportunities to further 
constrain ice-core dating throughout the Holocene and develop a 
framework of precisely dated, globally synchronized proxies of past 
climate variability and external climate forcing. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Ice cores. This study included new and previously described ice-core records 
from five drilling sites (Extended Data Fig. 1, Supplementary Data 1). The 
upper 577 m of the 3,405-m WAIS Divide (WDC) core from central West 
Antarctica and a 410-m intermediate-length core (NEEM-2011-S1) drilled in 
2011 close to the 2,540-m North Greenland Eemian Ice Drilling (NEEM)” ice 
core have previously been used to reconstruct sulfate deposition in both polar 
ice sheets'®. These coring sites are characterized by relatively high snowfall 
(~200kgm™~*yr~') and have comparable elevation, latitude, and deposition 
regimes. WDC and NEEM-2011-S1 provided high-resolution records that 
allowed annual-layer dating based on seasonally varying impurity content'®. 
New ice-core analyses included the upper 514m of the main NEEM core used 
to extend the record of NEEM-2011-S1 to cover the past 2,500 years, as well as 
B40 drilled in 2012 in Dronning Maud Land in East Antarctica and TUNU2013 
drilled in 2013 in Northeast Greenland—both characterized by lower snowfall 
rates (~70-100 kg m” yr 1) Volcanic sulfate concentration from B40 had been 
reported previously for the past 2,000 years'’, but we extended measurements to 
200 m depth to cover the past 2,500 years. 
High-resolution, ice-core aerosol analyses. Ice-core analyses were performed at 
the Desert Research Institute (DRI) using 55-100-cm-long, longitudinal ice-core 
sections (33 mm X 33 mm wide). The analytical system for continuous analysis 
included two Element2 (Thermo Scientific) high-resolution inductively coupled 
plasma mass spectrometers (HR-ICP-MS) operating in parallel for measurement 
of a broad range of ~35 elements; an SP2 (Droplet Measurement Technologies) 
instrument for black carbon measurements; and a host of fluorimeters and spec- 
trophotometers for ammonium (NH,*), nitrate (NO; ), hydrogen peroxide 
(H,O ), and other chemical species. All measurements were exactly co-registered 
in depth, with depth resolution typically less than 10-15 mm*'~*?. We corrected 
total sulfur (S) concentrations for the sea-salt $ contribution using sea-salt Na 
concentrations’®. Measurements included TUNU2013 and NEEM (400-515 m) 
in Greenland, and B40 in Antarctica (Extended Data Fig. 1). Gaps (that is, ice not 
allocated to DRI) in the high-resolution sulfur data of the NEEM core were filled 
with ~4-cm-resolution discrete sulfate measurements using fast ion-chromato- 
graphy techniques” performed in the field between 428 m and 506 m depth. 
Independent analyses of the upper part of the NEEM main core were per- 
formed in the field using a continuous flow analysis (CFA) system* recently 
modified to include a new melter head design®’. Ca*, NH,*, and HO, were 
analysed by fluorescence spectroscopy; Na* and NO; _ by absorption spectro- 
scopy; conductivity of the meltwater by a micro flow cell (Amber Science); and a 
particle detector (Abakus, Klotz) was used for measuring insoluble dust particle 
concentrations and size distribution”. Effective depth resolution was typically 
better than 20 mm. Measurements were exactly synchronized in depth using a 
multicomponent standard solution; the accuracy of the depth assignment for all 
measurements was typically better than 5 mm. 
High-resolution measurements of '°Be in ice cores using AMS. Accelerator 
mass spectrometry (AMS) was used to analyse samples from the NEEM-2011-S1, 
WDC, NGRIP, and TUNU2013 ice cores encompassing the time period of 
the A'*C anomalies from tree-ring records'*”?5 were used for '°Be analysis 
(Supplementary Data 1). NEEM-2011-S1 and WDC were sampled in exact 
annual resolution, using the maxima (minima in WDC) of the annual cycles of 
Na concentrations to define the beginning of the calendar year'®. NGRIP was 
sampled at a constant resolution of 18.3 cm, providing an age resolution of about 
one year. Similarly, TUNU2013 was sampled in quasi-annual resolution accord- 
ing to the average annual-layer thickness expected at this depth based on prior 
volcanic synchronization to NEEM-2011-S1. The relative age uncertainty for 
TUNU2013 with respect to the dependent NEEM-2011-S1 chronology at this 
depth is assumed to be +1 year at most, given a distinctive match for selected 
volcanic trace elements in both ice-core records (752-764 cE, NS1-2011 time- 
scale). Sample masses ranged between 100 g and 450 g, resulting in median overall 
quantification uncertainties of less than 4%-7%. The '°Be/*Be ratios of samples 
and blanks were measured relative to well documented '’Be standards? by AMS 
at Purdue’s PRIME laboratory (WDC, NEEM-2011-S1, Tunu2013) and Uppsala 
University (NGRIP)**°*. Results were corrected for an average blank '°Be/*Be 
ratio, corresponding to corrections of 2%-10% of the measured '°Be/*Be ratios. 
Annual-layer dating using the StratiCounter algorithm. For annual-layer 
interpretation, we used DRI’s broad-spectrum aerosol concentration data from 
WDC (188-577 m), NEEM-2011-S1 (183-411 m), and NEEM (410-515 m), as 
well as NEEM aerosol concentration data (183-514m) from the field-based 
CFA system. The original timescale for NEEM-2011-S1 was based on volcanic 
synchronization to the NGRIP sulfate record on the GICCO05 timescale and 
annual-layer interpretation between the volcanic age markers, whereas WDC 
was previously dated by annual-layer counting’®. 


Parameters with strong intra-annual variability included tracers of sea salt (for 
example, Na, Cl, Sr), dust (for example, Ce, Mg, insoluble particle concentration), 
and marine biogenic emissions such as non-sea-salt sulfur (nssS). Tracers of 
biomass-burning emissions, such as BC, NH,*, and NO,_, also showed strong 
seasonal variations in deposition during pre-industrial times'®*°*'. The data sets 
used for annual-layer interpretation are provided in Extended Data Table 1. For 
NEEM-2011-S1, the final database used for annual-layer dating included 13 
parameters and the ratio of nssS/Na. For WDC, the final database included five 
parameters and the ratio of nssS/Na. For NEEM (410-515 m depth), the final 
database included eight parameters (Na‘*, Ca?*, NH, *, H,0,, NO; _, conduc- 
tivity, insoluble particle concentrations, and electrical conductivity®’) from the 
field-based measurements and eleven parameters (Na, Cl, Mg, Mn, Sr, nssS, nssS/ 
Na, nssCa, black carbon, NO; , NH,") from the DRI system. 

We focused here on the time period before the large volcanic eruption of 
Samalas in 1257 cE”, clearly detectable as an acidic peak in both ice-core records, 
and consequently started annual-layer counting of NEEM-2011-S1, NEEM, and 
WDC at the depth of the corresponding sulfur signal. For the time period 1257 cE 
to present, ice-core chronologies were constrained by numerous historic erup- 
tions and large sulfate peaks, showing a strong association to Northern 
Hemisphere cooling events as indicated by tree-ring records’®. 

We applied the StratiCounter layer-detection algorithm™ to the multi-para- 
meter aerosol concentration records (n = 14 for NEEM-2011-S1; n = 6 for WDC; 
n= 8 for NEEM < 410 m; n = 19 for NEEM > 410 m) to objectively determine 
the most likely number of annual layers in the ice cores along with corresponding 
uncertainties. The StratiCounter algorithm is based on statistical inference in 
Hidden Markov Models (HMMs), and it determines the maximum-likelihood 
solution based on the annual signal in all aerosol records in parallel. Some of these 
displayed a high degree of similarity, so we weighted these records correspond- 
ingly lower. The algorithm was run step-wise down the core, each batch covering 
approximately 50 years, with a slight overlap. All parameters for the statistical 
description of a mean layer and its inter-annual variability in the various aerosol 
records were determined independently for each batch as the maximum-like- 
lihood solution. The algorithm simultaneously computes confidence intervals for 
the number of layers within given sections, allowing us to provide uncertainty 
bounds on the number of layers between selected age-marker horizons (Extended 
Data Table 2). 

Annual-layer detection in the NEEM main core below 410 m was made more 

difficult by frequent occurrence of small gaps in the two independent high-reso- 
lution aerosol data sets. Depending on the parameter, data gaps from the CFA 
field measurements accounted for up to 20% of the depth range between 410 m 
and 515m, but the combined aerosol records from both analyses provided an 
almost complete aerosol record with 96% data coverage. As this was the first time 
that the StratiCounter algorithm was used simultaneously on data records from 
two different melt systems, with different characteristics and lack of exact co- 
registration, we also manually determined annual layers below 410 m using the 
following approaches: one investigator used Na and nssCa concentrations and the 
ratio of nssS/Na (from DRI analysis) as well as Na‘ and insoluble particle con- 
centrations (from CFA analysis) as primary dating parameters. Black carbon, 
NH,", nssS, and conductivity were used as secondary dating parameters where 
annual-layer interpretation was ambiguous. A second investigator used DRI’s Na, 
Ca, BC, NH,* and CFA Na‘, Ca?*, and NH,* measurements as parameters. The 
annual-layer interpretation of the NEEM core between 410m and 514m from 
investigator 1 was within the interpretation uncertainties of the StratiCounter 
output, from which it differed by less than a single year over the majority of this 
section, and it differed from independently counted timescales (for example, 
GICC05)° by on average less than three years (Extended Data Fig. 2). This set 
of layer counts was used for the resulting timescale. 
New ice-core chronologies for NS1-2011 and WD2014. We defined the depth of 
NEEM-2011-S1 containing the maximum '°Be concentration as the year 775 CE. 
Relative to this constraint, the maximum-likelihood ages for three large volcanic 
sulfate peaks were within one year of documented historical reports from early 
written sources of prominent and sustained atmospheric dimming observed in 
Europe and/or the Near East (Extended Data Table 3, Supplementary Data 2). 
Automated-layer identification for NEEM-2011-S1 was therefore constrained by 
tying the respective ice-core volcanic signals to the corresponding absolute his- 
torically dated ages of 536 CE, 626cE, and 939 ck (Extended Data Table 2)— 
thereby creating a new ice-core timescale (NS1-2011). The volcanic sulfur signal 
corresponding to the eruption of Samalas believed to have occurred in late 12577" 
was constrained to 1258 CE to account for several months’ delay in sulfate depos- 
ition in the high latitudes. Before 86 CE (the bottom depth of NEEM-2011-S1), the 
NS1-2011 timescale was extended using the manually derived annual-layer inter- 
pretation of the combined NEEM aerosol data sets back to 500 BcE (Fig. 2). 
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In NS1-2011 we did not attribute acid layers to the historical eruptions Vesuvius 
79 and Hekla 1104, due toa lack of corroborative tephra at these depths in this anda 
previous study*’. Possible Vesuvian tephra was reported from the Greenland Ice 
Sheet Project (GRIP) ice core at 429.3 m depth™, but in view of the new annual- 
layer dating results (Extended Data Fig. 3), we concluded that this layer dates to 
87/88 cE. Furthermore, volcanic sulfate deposition values for the corresponding 
event show a strong spatial gradient over Greenland with highest values in north- 
west Greenland'® and lowest in central and south Greenland®, favouring the 
attribution of a volcanic source from the high latitudes. Documentary sources 
(Supplementary Data 2) also suggest that the main vector of ash transport following 
the Vesuvius 79 cE eruption was towards the eastern Mediterranean”. 

For WDC, we do not have other sufficiently well determined age constraints 
besides the rapid ‘°Be increase in 775 cE and the sulfur signal of the Samalas 1257 
eruption. Therefore, no additional constraints were used when creating the new 
ice-core timescale (“WD2014”) from the StratiCounter annual-layer interpreta- 
tion back to 396 BCE. 

Depth-age information for six distinctive marker horizons in Greenland is 
given, and five of these horizons were used to constrain NS1-2011 (Extended 
Data Table 3). Similarly, depth information, the number of annual layers, and 
95% confidence intervals between distinctive volcanic marker horizons are given 
for NEEM, NEEM-2011-S1, and WDC, supporting attribution of these ice-core 
signals to eruptions in the low latitudes with bipolar sulfate deposition. 
Evaluation of NS1-2011 using independent age information. We evaluated 
timescale accuracy using additional distinctive age markers not used during 
chronology development: 

(1) Tephra from the eruption of Changbaishan/Tianchi (China) was detected 
in NEEM-2011-S1 in 946-947 cE, in agreement with widespread documentary 
evidence of an eruption in that region in winter 946/47 cE also supported by a 
high-precision ‘*C wiggle-match age of 946 + 3cE obtained from a tree killed 
during this eruption™. 

(2) The rapid increase of 1°Be from the 994 cE event occurred in NEEM-2011- 
S1 in 993 cE, consistent with A’*C from Japanese tree rings showing that the 
rapid increase in radionuclide production took place between the Northern 
Hemisphere growing seasons of 993 cE and 994 cE”. 

(3) To assess the accuracy of the NS1-2011 timescale before the earliest age 
marker at 536 CE, we compiled an independent time series of validation points, 
featuring years with well dated historical reports of atmospheric phenomena 
associated with high-altitude volcanic dust and/or aerosols (Supplementary 
Data 2) as known from modern observations to occur after major eruptions 
(for example, the Krakatau eruption of 1883). These phenomena include dimin- 
ished sunlight, discoloration of the solar disk, solar coronae (that is, Bishop’s 
Rings), and deeply red twilights (that is, volcanic sunsets)®”°. Thirty-two events 
met our criteria as validation points for the pre-536 CE NS1-2011 timescale. For 
the earliest in 255 BCE, it was reported in Babylon that “the disk of the sun looked 
like that of the moon””’. For the latest in 501 cE, it was reported in North China 
that “the Sun was red and without brilliance””’. We found that NEEM volcanic 
event years (including both NEEM and NEEM-2011-S1 data) occurred closely in 
time (that is, within a conservative +3-year margin) to 24 (75.0%) of our valid- 
ation points (Extended Data Fig. 2). To assess whether this association arose 
solely by chance, we conducted a Monte Carlo equal means test with 1,000,000 
iterations (Supplementary Data 2) and found that the number of volcanic event 
years within three years of our validation points was significantly greater than 
expected randomly (P<0.001). A significant association was also observed 
(P< 0.001) when using less conservative error margins (+1 and +2 years) and 
when excluding any historical observations with less certainty of a volcanic origin 
(Supplementary Data 2). When placing volcanic event years on the original 
GICCO05 timescale, we did not observe any statistically significant association 
with our independent validation points. 

Potential causes of a previous ice-core dating bias. Interpretation of annual 
layers in ice cores is subject to accumulating age uncertainty due to ambiguities in 
the underlying ice-core profiles’. Bias in existing chronologies may arise from 
several factors, including: (1) low effective resolution of some ice-core measure- 
ments (NGRIP, GRIP); (2) use of only single (or few) parameters for annual-layer 
interpretation (GRIP, Dye-3 ice cores); (3) intra-annual variations in various ice- 
core parameters falsely interpreted as layer boundaries (for example, caused by 
summer melt in Dye-3)”; (4) use of tephra believed to originate from the 79 CE 
Vesuvian eruption™ as a fixed reference horizon to constrain the Greenland ice- 
core dating”; (5) use of manual-layer interpretation techniques that may favour 
interpretations consistent with a priori knowledge or existing chronologies 
(WDC)!82!, 

Volcanic synchronization of B40, TUNU2013, and NGRIP. Two high-resolution 
sulfur ice-core records (TUNU2013, Greenland and B40, Antarctica) were syn- 
chronized to NEEM-2011-S1 and WDC, respectively, using volcanic stratigraphic 
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age markers’’ with relative age uncertainty between the tie-points estimated 
to not exceed +2 years. The NGRIP sulfate record measured at 5 cm depth 
resolution’® similarly was synchronized to NS1-2011 using 124 volcanic tie- 
points between 226 and 1999 cE. During the time period with no sulfur record 
yet available for WDC (before 396 BCE), a tentative chronology for B40 was 
derived by linearly extrapolating mean annual-layer thickness for B40 as derived 
from the synchronization to WDC between the earliest volcanic match points. 
2,500 year global volcanic forcing ice-core index. We constructed an index of 
global volcanic aerosol forcing by (1) re-dating and extending to 500 BCE an 
existing reconstruction of sulfate flux from an Antarctic ice-core array’’ by apply- 
ing an area weighting of 80/20 between East Antarctica and West Antarctica 
to B40 and WDC volcanic sulfate flux values, respectively; (2) compositing 
NGRIP and the NEEM-2011-S1/NEEM sulfate flux records to a similar 
Greenland sulfate deposition composite back to 500 BcE; (3) using established 
scaling functions®”* to estimate hemispheric sulfate aerosol loading from both 
polar ice-core composites; and (4) scaling global aerosol loading to the total (that 
is, time-integrated) radiative volcanic aerosol forcing following the Tambora 
1815 eruption’. Since the NS1-2011 and WD2014 timescales are independent 
of each other, the timing of bipolar events had to be adjusted to follow a single 
timescale to derive a unified global volcanic forcing series. We chose NS1-2011 as 
the reference chronology for most of the volcanic time series because this age 
model was constrained and validated by more stratigraphic age markers than 
WD2014. WD2014 was used as the reference chronology only between 150 cE 
and 450 cE, because of better data quality during that time period. TUNU2013 
was not included in the Greenland ice-core composite because annual-layer 
thickness variability at this site is influenced strongly by glaciological processes, 
leading to relatively large uncertainties in atmospheric sulfur-deposition deter- 
minations. 

Northern Hemisphere tree-ring composite. Tree-ring records from certain 
locations reflect summer cooling (as is widely observed after volcanic eruptions) 
with no age uncertainty in annual ring-width dating, thus allowing independent 
validation of ice-core timescales and the derived volcanic forcing indices. 
However, no tree-ring-based temperature reconstructions of large spatial scales 
span the full 2,500 years represented by our new ice-core chronologies. To thus 
evaluate our new ice-core chronologies and assess the consistency of response 
throughout the past 2,500 years, we compiled a composite (entitled “N-Tree’) of 
multi-centennial tree growth records at locations where temperature is the lim- 
iting growth factor. We selected available Northern Hemisphere tree-ring records 
that provided a continuous record of >1,500 years and showed a significant 
positive relationship with JJA temperatures during the instrumental period 
(1901-2000 ce) with P<0.005 (adjusted for a reduced sample size owing to 
autocorrelation of the data sets). In total, five tree-ring chronologies (three based 
on ring-width measurements, two based on measurements of maximum late- 
wood density) met these criteria****’°”* of which three are located in the high 
latitudes of Eurasia (Extended Data Fig. 1). 

As various climatic and non-climatic parameters may influence sensitivity of 
tree growth to temperatures during the twentieth century” “’, we used the time 
period 1000-1099 cE as a common baseline for standardizing tree growth anom- 
alies among the five chronologies and built a tree growth composite record called 
N-Tree (z-scores) by averaging the individual records. Correlations between 
N-Tree (N=5) and the average of three regional reconstructions for the 
Arctic, Europe, and Asia (N > 275)* between 1800 cE and 2000 cz are very high 
(r = 0.86, N = 201, P< 0.0001), suggesting that much of the large-scale variation 
in temperature is explained by these selected tree-ring records. Three records in 
N-Tree cover the period from 138 BCE to the present, thus allowing at least a 
qualitative assessment of the coherence of growth reduction following large vol- 
canic eruptions before the Common Era (Fig. 2, Extended Data Fig. 4). 
Temperature reconstructions. To quantify the Common Era climate impact and 
investigate regional differences, we used tree-ring-based JJA temperature recon- 
structions covering the past 2,000 years with a demonstrated strong relationship 
(r= 0.45; P< 0.0001; Extended Data Fig. 1) to instrumental JJA temperature 
data*’ between 1901 and 2000. For regions where this criterion was met by several 
reconstructions (for example, Scandinavia), we limited the analysis to the most 
recently updated reconstruction’. Three regional reconstructions from Central 
Europe”, Northern Europe”, and Northern Siberia (Yamal, not shown)” as well 
as a continental-scale reconstruction for Europe’ met this criterion and were used 
to quantify the average response of summer temperature to volcanic forcing 
during the Common Era (Figs 3 and 4). 

Superposed epoch analyses. To assess tree-ring growth reduction and summer 
cooling following large eruptions, we used superposed epoch analyses****. 
We selected all volcanic eruptions (28 events in total, 24 cE events) with time- 
integrated volcanic forcing greater than —7.5 Wm ~ (that is, eruptions larger 
than Pinatubo 1991) and aligned the individual segments of N-Tree and regional 
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JJA temperature reconstructions relative to ice-core-indicated peak forcing. The 
composite response was calculated for the average of the individual series (lag 0 to 
lag 10 or 15 years) relative to the average values five years before individual 
volcanic events (lag —5 to lag —1 year). 95% confidence intervals represent 2 
s.e.m. of the tree-growth (Extended Data Fig. 4) and temperature anomalies 
(Fig. 4) associated with the multiple eruptions. 

Cryptotephra analyses of the 536 cE sample from NEEM-2011-S1. We analysed 
samples from NEEM-2011-S1 for tephra between 326.73 m and 328.06 m depth, 
corresponding to 531-539 cE (NS1-2011 timescale). Samples (200g to 500g) 
were filtered, and elemental composition of recovered volcanic glass shards 
determined by electron microprobe analysis at Queen’s University Belfast 
using established protocols**”** and secondary glass standards***’. Between 
326.73 m and 327.25 m, large volume samples were cut at 8 cm depth resolution 
($0.5 years) and with an average cross-section of 26 cm’. Between 327.25 m and 
328.06 m, the average cross-section was 7 cm” and depth resolution 20 cm (~1 yr 
resolution). Tephra particles (n = 17) were isolated from a sample of ice (327.17- 
327.25 m depth, 251 g) corresponding to the sulfate spike at 536 cE. The glass 
shards were heterogeneous in size (20-80 tm), morphology (platey, blocky, vesi- 
cular, microlitic), and geochemistry (andesitic, trachytic, rhyolitic). Individual 
shards had geochemical compositions that share affinities with volcanic systems 
in the Aleutian arc (Alaska)**, Northern Cordilleran volcanic province (British 
Columbia), and Mono-Inyo Craters area (California)””’'—indicating at least 
three synchronous eruptive events, all situated in western North America 
between 38 °N and 58 °N (Extended Data Fig. 5; Supplementary Data 5). 

Data and code availability. Ice-core data (chemistry, including sulphur and 
1Be), the resulting timescales, and the volcanic forcing reconstruction are 
provided as Supplementary Data 1, and 3-5). Historical documentary data are 
provided as Supplementary Data 2. The code for the StratiCounter program 
is accessible at the github repository (http://www.github.com/maiwinstrup/ 
StratiCounter). NGRIP SO, data can be obtained at http://www.iceandclimate. 
nbi.ku.dk/data/2012-12-03_NGRIP_SO4_5cm_Plummet_et_al_CP_2012.txt. 
Tree-ring records and temperature reconstructions are from the Supplementary 
Database S1 and S2 of the Pages-2k Consortium (ref. 3; http://www.nature.com/ 
ngeo/journal/v6/n5/full/ngeo1797.html#supplementary-information). 
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Extended Data Figure 1 | Location of study sites. a, Map showing locations 
(blue circles) of the five ice cores (WDC, B40, NEEM, NGRIP and TUNU) 
used in this study. Sites of temperature-limited tree-ring chronologies 
(green)?”*’*’8 and sites with annual AMC measurements from tree-rings in 
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Extended Data Figure 2 | Volcanic dust veils from historical documentary 
sources in relation to NEEM. Time series of 32 independently selected 
chronological validation points from well dated historical observations of 
atmospheric phenomena with known association to explosive volcanism (for 
example, diminished sunlight, discoloured solar disk, solar corona or Bishop’s 
Ring, red volcanic sunset) as reported in the Near East, Mediterranean 
region, and China, before our earliest chronological age marker at 536 cE. Black 
lines represent the magnitude (scale on y axes) of annual sulfate deposition 


measured in NEEM (NEEM and NEEM-2011-S1 ice cores) from explosive 
volcanic events on the new NS1-2011 timescale. Red crosses depict the 24 (75%) 
historical validation points for which NEEM volcanic events occur within a 
conservative +3-year uncertainty margin. Blue crosses represent the eight 
points for which volcanic events are not observed. The association between 
validation points and volcanic events is statistically significantly non-random 
at>99.9% confidence (P < 0.001). ppb, parts per billion. 
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Extended Data Figure 3 | Timescale comparison. Age differences of the 
timescales NS1-2011 and GICC05 for the NEEM-2011-S1/NEEM ice cores 


(a) and WD2014 and WDC06A-7 for WDC (b). Differences before 86 cE (the 
age of the ice that is now at the bottom of the ice core NEEM-2011-S1) deriving 
from the annual-layer counting of the NEEM core are shown for major volcanic 


eruptions relative to the respective signals in NGRIP on the annual-layer 


counted GICC05 timescale. Marker events used for constraining the annual- 


layer dating (solid line) and for chronology evaluation (dashed lines) are 
indicated. Triangles mark volcanic signals. Also indicated is the difference 


between WD2014 and the Antarctic ice-core chronology (AICC2012)”, based 


on volcanic synchronization between the WDC and EDC96 ice cores. 
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Extended Data Figure 4 | Post-volcanic suppression of tree growth. multi-centennial tree-ring composite record (N-Tree) 15 years after the year 
Superposed epoch analysis for large volcanic eruptions using the 28 largest of volcanic sulfate deposition, relative to the average of five years before the 
volcanic eruptions (a); the 23 largest tropical eruptions (b); the five largest events. Dashed lines indicate 95% confidence intervals (2 s.e.m.) of the tree-ring 


Northern Hemisphere eruptions (c); and eruptions larger than Tambora 1815 — growth anomalies associated with the multiple eruptions. 
with respect to sulfate aerosol loading (d). Shown are growth anomalies of a 
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Extended Data Figure 5 | Major-element composition for ice core tephra 
QUB-1859 and reference material. Shown are selected geochemistry data: 
SiO, versus total alkali (K,O + Na2O) (a); FeO (total iron oxides) versus TiO, 
(b); SiOz versus Al,O3 (c); and CaO versus MgO (d) from 11 shards extracted 
from the NEEM-2011-S1 ice core at 327.17-327.25 m depth, representing 
the age range 536.0-536.4 CE on the new, NS1-2011 timescale. Data for Late 
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© Aniakchak reference data 
@ Mono Craters compiled data 
®@ Upper Finlay tephra (mean + 


1s.d.- see Suppl. Data) 
4 QUB-1859 Shards 1, 10 & 11 
“ QUB-1859 Shard 12 
4 QUB-1859 Shard 2 
4 QUB-1859 Shards 3, 4 & 13 
4 QUB-1859 Shards 5 & 7 
4A QUB-1859 Shard 8 
4 6 10 
CaO 


Holocene tephra from Mono Craters (California) are from the compilation by 
ref. 90; data for Aniakchak (Alaska) are from reference material published 

by ref. 88; and data for the early Holocene upper Finlay tephra, believed to be 
from the Edziza complex in the Upper Cordilleran Volcanic province (British 
Columbia), are from ref. 89. (See Supplementary Information for the Upper 
Finlay tephra.) 
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Extended Data Table 1 | Ice-core dating 


Ice Core 


NEEM-2011-S1, (Greenland), 183-411m 


NEEM, (Greenland), 183-410m 


NEEM, (Greenland), 410-514m 


Ice-core parameter 


Na, Cl 

nssS, nssS/Na 

nssCa, Mn, Ce 

BC, NH;’, BC geometric mean particle size 
Sr, Mg, |, NOs 


Ca™, particle count 
NH’, NOs 
conductivity, ECM, H2O> 


Na, Sr, Cl, Na” 

nssS, nssS/Na 

nssCa, Mn, Mg, particle count, Ca’* 
BC, NH,", NOs ,NH,", NOs 


Dominant aerosol source 


sea salt 

marine biogenic emissions 
dust 

biomass burning 

various 


sea salt 

dust 

biomass burning 
various 


sea salt 

marine biogenic emissions 
dust 

biomass burning 


Deposition maximum 


winter 
summer 
spring 
summer 


winter 
spring 
summer 


winter 
summer 
spring 
summer 


conductivity, ECM, H»O> various 

W0DCG, (Antarctica), 188-577m Na, Sr sea salt austral winter 
nssS, nssS/Na, Br marine biogenic emissions austral summer 
BC biomass burning austral summer/fall 

Year CE Depth (m) Event Parameter Independent age information 

NEEM-2011-S1 

1258 183.49 volcano (tropical) nssS ice core (GICCO5, Law Dome) 

goat 237.89 cosmic ray anomaly Be tree ring 

946 247.21 volcano (NH, Tianchi) nssS, tephra historical observation 

939. 248.87 volcano (NH, Eldgja) nssS historical observation 

775, 281.45 cosmic ray anomaly 1% tree ring 

626, 310.01 volcano (NH) nssS historical observation 

536 327.23 volcano (NH) nssS, tephra historical observation 

WoC 

1258 188.91 volcano (tropical) nssS ice core (GICCO5, Law Dome) 

775 303.36 cosmic ray anomaly *Be tree ring 


Parameters used for annual-layer interpretation. Parameters measured by the CFA system in the field are underlined. NH, Northern Hemisphere. 
*Stratigraphic age marker used to constrain annual-layer counting. 
+Horizons used to evaluate the timescale. 
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Extended Data Table 2 | Annual-layer results using the StratiCounter program 


Bipolar marker horizons 


Event" Depth (m) Annual layers between horizons 
[95% confidence interval] 
WDC NEEM-2011-S1 NEEM WDC NEEM-2011-S1 NEEM Weighted Year [2o]' 
average [20] (BCE/CE) 

Samalas 188.81 183.43 183.05 1259 [+2] 
775 Event 303.37 281.25 280.89 484 [481;487] 482 [479;485] 486 [483;490] 484 [+2] 775° 
UE 682 326.01 299.26 298.80 92 [91;93] 92 [91;93] 93 [(93;95] 92 [+1] 683 [+1] 
UE574 351.67 320.33 319.95 108 [108;108] 109 [108;110] 109 [107;111] 108 [+1] 575 [+2] 
UE5S40 360.03 326.61 326.25 34 [34;35] 36! [(35;37] 33 [33;34] 34 [+1] 541 [+2] 
UE 266 423.81 377.43 377.15 274 [272;277] 273 [270;276] 272 (271;274] 273 [+2] 268 [+3] 
UE-44 495.21 433.80 312 [310;315] 310[309;313]" 311 [+2] -44 [+4] 
UE -426 501.15 385 [383; 389)" 385 [24]" -429 [+6]" 


Greenland marker horizons 


Event Depth (m) Number annual layers [95% confidence] Maximum likelihood year Independent 
(wrt to 775 CE) Age (CE) 

NEEM-2011-S1 NEEM NEEM-2011-S1 NEEM NEEM-2011-S1 NEEM 

Samalas 183.43 183.05 1257 1261 1257/58* 

Eldgja 248.76 248.40 317 [315;319] 320 [318;323] 940 941 939% 

775 Event 281.25 280.89'  165[163;167] 166 [164;168] 775° 775° 775° 

UE 626/27 309.96 309.60 148[147;149] 150 [148;152] 627 625 626/627* 

UE 536/37 327.20 326.84 92! [91,94] 89 [88;90] 535 536 536% 

UE 87/88" 410.56 410.20 446 [443;449] 449 [446;452] 89 87 


Maximum-likelihood number of annual layers and confidence intervals derived from annual-layer counting between distinctive marker horizons and corresponding ages relative to the 775 cE 1Be event. wrt, with 
respect to. 
*Unattributed events (UE) give volcanic signal and year of sulfate deposition based on final age models. 

+Year calculated from the number of annual layers relative to the fixed age marker in 775 ce (negative numbers are years BCE). 

{Depth has been estimated from the average depth offset between NEEM-2011-S1 and NEEM. 

§Fixed age marker based on the !°Be maximum annual value. 

||Section with 6-m gap in the NEEM 2011-S1 core DRI data (this section is not used for calculating average age). 

This section is based on the NEEM field CFA data, since the DRI data does not cover the entire interval. 

#Section is based on combined data set of DRI and field-measured CFA data. The number of annual layers in this section from manual interpretation by investigator 1 was 383 (+7), and that of investigator 2 was 
393 (+8) layers. Most of the difference between the three layer counts occurred below 480 m (before 300 sce), where data gaps were more frequent. 

yvIndependent age markers used to constrain annual-layer dating in a second iteration to derive the final ice-core age model NS1-2011. 

**Tephra particles were extracted from the depth range 327.17-327.25 m (see Supplementary Data). 

++Unattributed volcanic signal that was previously attributed to the historic 79 ce eruption of Vesuvius. 
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Extended Data Table 3 | Historical documentary evidence for key volcanic eruption age markers 536-939 cE 


Year (Start) 


536 


626 


939 


Summary 


Diminished sunlight 
for >12 months 
(Mediterranean) 


Diminished sunlight 
for 9 months 
(Mediterranean, NW 
Europe) 


Diminished sunlight 
(Mediterranean, NW 
Europe) 


Translation 


For the sun gave forth its light without 
brightness, like the moon, during this whole 
year, and it seemed exceedingly like the sun 
in eclipse, for the beams it shed were not 
clear nor such as it is accustomed to shed. 


There was an eclipse of the sun and it lasted 
from October [626] until June [627], that is, 
for nine months. Half of its disc was eclipsed 
and the other half not; only a little of its 
light was visible. 


We observed the sun: it did not have any 
strength, brightness, nor heat. Indeed, we 
saw the sky and its colour changed, as if 
flushed. And others said that the sun was 
seen as if halved. 


Selected source(s) 


Procopius, History of the Wars, H.B. 
Dewing, trans. (Harvard, 1916), 4.14. 
(five additional sources) 


Theophilus of Edessa’s Chronicle, R.G. 


Hoyland, trans. (Liverpool, 2011), p. 
73. 
(five additional sources) 


Annales Casinates, Monumenta 
Germaniae Historica, Scriptores, ed. 
G.H. Pertz (Hannover, 1839), p. 172. 
Trans. for this paper by C. Kostick. 
(three additional sources) 


Confidence 


High; 

Eyewitness or 
contemporary with a 
reliable chronology. 


High; 

Eyewitness or 
contemporary but with 
some chronological 
uncertainty. 


High; 

Eyewitness or 
contemporary with a 
reliable chronology. 


A comprehensive list of sources, including translations and assessment of the confidence placed in each source and its chronological information is given in Supplementary Data. NW, northwest. 
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Extended Data Table 4 | Large volcanic eruptions during the past 2,500 years 


Rank 


ow on TD WH FSF Ww NY FB 


NN N NY NR Fe Be BP Be Be Be Be Be Be 
> WwW NY FP CO BO Oo NN DD UV F&F Ww NY KF CO 


25 


Year 


Vole. SOx” 
Greenland 
(kg km”) 
99.8 


90.4 
100.6 
39.0 
61.2 
39.7 
56.4 
135.8 
38.4 
38.3 
61.0 
34.6 
48.3 
44.2 
39.2 
39.1 
37.0 
99.0 
28.6 
88.7 
27.6 
20.6 
83.1 
27.9 
72.2 


Vole. SO,” 
Antarctica 
(kg km”) 
78.2 


73.4 
15.4 
63.6 
34.4 
45.8 
23.1 


Global 
forcin, 
f 


(Wm: 
-35.6 
-32.8 
-23.2 
-20.5 
-19.1 
-17.1 
-15.9 
-15.5 
-15.4 
-14.5 
-14.5 
-12.0 
-12.0 
-11.8 
-11.6 
-11.5 
-11.3 
-11.3 
-10.2 
-10.1 
-9.7 
-9.6 
-9.5 
-9.4 
-8.2 


Cold 
year 


-425 
1258 
-43 
1459 
541 
1816 
1230 
1783 
682 
574 
268 
1810 
1109 
1641 
1601 
170 
1171 
536 
1696 
940 
1288 
432 
87 
1346 
627 


N-Tree 
(z scores; 1000-99) 


-2.74 
-1.43 
-3.33 
-2.31 
-2.57 
-2.51 
-1.71 
-1.16 
-0.95 
-2.46 
-1.70 
-2.18 
-1.99 
-2.31 
-2.62 
-0.80 
-0.91 
-3.36 
-1.63 
-1.81 
-1.49 
-0.45 
-0.22 
-2.18 
-3.00 


Teurope/Arctic 
(Cc; 1961-90) 


1.74 
1.28 
“1.44 
0.65 
0.25 
0.49 
1.48 
0.93 
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Volcanot/ Region 


UE -426 
Samalas/Indonesia 
Chiltepe?/Nicaragua 
Kuwae/Vanuatu 
llopango?/El Salvador 
Tambora/Indonesia 
UE 1230 
Laki/Iceland 
Pago?/New Britain 
Rabaul ?/New Britain 
UE 266 

UE 1809 

UE 1108 

Parker /Philippines 
Huaynaputina/Peru 
UE 169 

UE 1171 

UE 536 

UE 1695 
Eldgja/Iceland 
Quilotoa?/Ecuador 
UE 433 

UE 87 

El Chichon?/Mexico 
UE 626 


Years with negative numbers are BcE. Tentative attribution of ice-core signals to historic volcanic eruptions is based on the Global Volcanism Program volcanic eruption database®*. Average (summer) temperature 
for the associated cold year is given for the average of Europe and the Arctic®. Volc., volcanic. 
*Total global aerosol forcing was estimated by scaling the total sulfate flux from both polar ice sheets to the reconstructed total (that is, time integrated) aerosol forcing for Tambora 1815’ (Methods); for high- 
latitude Northern Hemisphere eruptions, Greenland fluxes were scaled by a factor of 0.57°. 
+Unattributed volcanic events (UE) and tentative attributions for non-documented historic eruptions (?) are marked. 
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Extended Data Table 5 | Post-volcanic cooling 


JJAtemperature JJA temperature 

Rank Year anomaly (°C) Volcanic event(s) Decade anomaly (°C) Volcanic event(s) 
1 1821 -1.82 1815, 1821 1600-1609 -1.17 1595, 1601 
2 1601 -1.82 1600 536-545 -1.12 536, 540 
3 1675 -1.78 1673 1812-1821 -1.10 1809, 1815 
4 536 -1.67 536 1453-1462 -1.01 1453, 1458 
5 800 -1.66 800 1587-1596 -0.98 1585, 1590, 1595 
6 1816 -1.64 1815 1107-1116 -0.96 1108 1115 
7 1453 -1.57 1453 1344-1353 -0.87 1341, 1345 
8 1633 -1.56 351-360 -0.83 351, 358, 360 
9 1109 -1.56 1108 1692-1701 -0.82 1693, 1695 
10 1608 -1.53 413-422 -0.79 411, 418 
11 544 -1.50 540 1463-1472 -0.79 1458 1463, 1470 
12 574 -1.48 574 1127-1136 -0.79 1127 
13 1695 -1.47 1693, 1695 389-398 -0.78 388, 393 
14 543 -1.46 540 1672-1681 -0.77 1673 
15 541 -1.43 540 1632-1641 -0.76 1637, 1641 
16 549 -1.43 547 1258-1267 -0.76 1258 

Tree growth Tree growth 

anomaly anomaly 
Rank Year (z-scores) Volcanic event(s) Decade (z-scores) Volcanic event(s) 
1 536° 3.4 536 536-545 22 536, 540 
2 43 33 44 1812-1821 22 1809, 1815 
3 627° -3.0 626 1453-1462 -2.0 1453, 1458 
4 543 -3.0 540 -43 to-34 -2.0 -46,-44 
5 -360 -2.9 -360 1601-1610 -2.0 1600 
6 -35 -2.8 -35 -361 to -352 -18 -360,-356 
7 425° -2.7 ~426 1463-1472 -17 1458 1463 
8 42° -2.7 44 1832-1841 17 1831, 1835 
9 546 -2.6 540 1341-1350 17 1344 
10 -140° -2.6 -141 546-555 1.6 540, 547 
11 541 -2.6 540 1673-1682 15 1673 
12 544 -2.6 540 -427 to -418 15 426 
13 545 -25 540 1330-1339 14 1329, 1336 
14 574 -25 574 1638-1647 -14 1641 1646 
15 354° -25 -356 1699-1708 14 1695, 1708 
16 -38 -25 1285-1294 -14 1286 


Coldest years and decades (1-2000 ce, JJA temperature with respect to 1901-2000) for Europe* and years (500 BcE-1250 ce) and decades (500 Bce-2000 ce) with strong growth reduction in the N-Tree composite 
(with respect to 1000-1099). Ages of the volcanic events from the ice cores reflect the start of volcanic sulfate deposition in Greenland (NS1-2011 timescale) with the largest 40 events indicated in bold letters and 
tropical eruptions underlined. Years with negative numbers are before the Common Era (BCE). 

*Latewood frost ring in bristlecone pines within one year**. 
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Metabolic co-dependence gives rise to 
collective oscillations within biofilms 


Jintao Liu!, Arthur Prindle!*, J acqueline Humphries'*, Margal Gabalda- Sagarra**, Munehiro Asally**, Dong-yeon D. Lee!, 


San Ly!, Jordi Garcia-Ojalvo & Giirol M. Stiel! 


Cells that reside within a community can cooperate and also compete with each other for resources. It remains unclear 
how these opposing interactions are resolved at the population level. Here we investigate such an internal conflict within 
a microbial (Bacillus subtilis) biofilm community: cells in the biofilm periphery not only protect interior cells from 
external attack but also starve them through nutrient consumption. We discover that this conflict between protection 
and starvation is resolved through emergence of long-range metabolic co-dependence between peripheral and interior 
cells. As a result, biofilm growth halts periodically, increasing nutrient availability for the sheltered interior cells. We 
show that this collective oscillation in biofilm growth benefits the community in the event of a chemical attack. These 
findings indicate that oscillations support population-level conflict resolution by coordinating competing metabolic 
demands in space and time, suggesting new strategies to control biofilm growth. 


Cooperation and competition are complex social interactions that can 
have critical roles in biological communities. Cooperative behaviour 
often increases the overall fitness of the population through processes 
such as division of labour and production of common goods'™. At the 
same time, individuals in a community compete with each other for 
limited resources, such as nutrients**. Here we investigated bacterial 
biofilms’"° to determine how the conflict between the opposing social 
behaviours of cooperation and competition could be resolved at the 
community level to increase overall fitness. 

Biofilms typically form under environmental stress conditions, 
such as nutrient limitation'’”’. As these bacterial communities grow 
larger, the supply of nutrients to interior cells becomes limited due to 
an increase in nutrient consumption associated with the growth of 
multiple layers of cells in the biofilm periphery. Severe nutrient 
limitation for interior cells is detrimental to the colony, since the 
sheltered interior cells are critical to the survival of the biofilm 
community in the event of an external challenge. This defines a fun- 
damental conflict between the opposing demands for biofilm growth 
and maintaining the viability of protected (interior) cells (Fig. 1a). The 
identification of possible mechanisms that ensure the viability of the 
protected interior cells is fundamental to understanding biofilm 
development’*”>. 

To investigate directly how Bacillus subtilis biofilms continue 
expanding while sustaining interior cells, we converted the potentially 
complex three-dimensional problem to a simpler two-dimensional 
scenario using microfluidics. Specifically, we used growth cham- 
bers that are unconventionally large in the lateral, x-y dimensions 
(3 X 3 mm), while confining biofilm thickness (z dimension) to only a 
few micrometres (Fig. 1b). Therefore, biofilm expansion in this device 
is predominantly limited to two dimensions, creating a ‘pancake-like’ 
configuration. In fact, biofilms often form in confined aqueous envir- 
onments and thus this microfluidic chamber may better mimic those 
growth conditions’’"’. This experimental set-up is thus ideal to inter- 
rogate how biofilms can reconcile the opposing benefits of growth and 
protection during biofilm development. 


Oscillations in biofilm growth 


Unexpectedly, we observed oscillations in biofilm expansion despite 
constant media flow within the microfluidic device (Fig. 1c, d, 
Supplementary Video 1 and Extended Data Fig. 1a). Specifically, bio- 
films exhibit periodic reduction in colony expansion that is self- 
sustained and can last for more than a day (Fig. le and Extended 
Data Fig. 1b). The period of oscillations has a mean of 2.5 +0.8h 
(standard deviation (s.d.), 1 = 63 colonies), which is less than the 
duration of the average cell replication time of 3.4+0.2h (s.d., 
n=21 cell cycles) under this growth condition (Fig. 1f and 
Methods, “Data analysis’ section). Moreover, oscillations only arise 
when the biofilm exceeds a certain colony size (Supplementary Video 
2). In particular, quantitative measurements obtained from 53 indi- 
vidual biofilms indicate that oscillations emerge in colonies that 
exceed an average diameter of 580 + 85 jm (s.d., n = 53 colonies), 
which corresponds to approximately one million cells (Fig. 1g, h). 
Together, these data show that oscillations arise during biofilm 
formation and are self-sustained. 

Given that biofilms typically form under nutrient-limited condi- 
tions and bacterial growth is generally controlled by metabolism, we 
hypothesized that metabolic limitation has a key role in the observed 
periodic halting of biofilm expansion. In particular, after determining 
that carbon source limitation did not have an essential role in the 
oscillations (Extended Data Fig. 2), we focused on nitrogen limitation. 
The standard biofilm growth media (MSgg, see Methods, “Growth 
conditions’ section) used to study B. subtilis biofilm development 
contains glutamate as the only nitrogen source’®. In most organisms 
including B. subtilis, glutamate is combined with ammonium by glu- 
tamine synthetase (GS) to produce glutamine, which is essential for 
biomass production and growth (Fig. 2a)'’. Cells can obtain the 
necessary ammonium from glutamate through the enzymatic activity 
of glutamate dehydrogenase (GDH), expressed by the rocG or gudB 
genes in the undomesticated B. subtilis used in this study (Fig. 2a)'*°. 
To determine whether biofilms experience glutamine limitation, 
we measured expression of nasA, one of several genes activated in 


1Division of Biological Sciences, University of California San Diego, California 92093, USA. Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003 Barcelona, Spain. °Warwick 
Integrative Synthetic Biology Centre, School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK. 


*These authors contributed equally to this work. 
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Figure 1 | Biofilms grown in microfluidic devices show oscillations in 
colony expansion. a, Biofilms must reconcile opposing demands for 
protection from external challenges (gradient indicated in purple) and access to 
nutrients (gradient indicated in grey). b, Schematic of the microfluidic device 
used throughout this study. Direction of media flow is indicated by the blue 
arrows. c, Phase contrast image of a biofilm growing in the microfluidic 
device. The yellow arrow indicates the region of interest in panel d. d, Film strip 
of a radius of the biofilm over time shows a pause in colony expansion. This 
film strip represents one cycle of biofilm oscillations, indicated by the shaded 
region in panel e. Scale bar, 5 um. The arrowheads indicate direction of 
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biofilm growth. e, Growth rate over time shows persistent oscillations in colony 
expansion. f, Histogram of the average period of oscillations for each colony 
(n = 63 colonies, mean = 2.5 h, s.d. = 0.8 h). The cell replication time is 
approximately 3.4 h under these conditions (Methods, “Data analysis’ section). 
g. Growth rate as a function of colony diameter (which increases in time) shows 
that early colony growth does not exhibit oscillations. The orange line 
indicates the diameter (~600 jim) at which this colony initiates oscillations. 
h, Histogram of the diameter at which a colony begins to oscillate (n = 53 
colonies, mean = 576 uum, s.d. = 85 um). 
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Figure 2 | Biofilm growth depends specifically on extracellular ammonium 
availability. a, Colony growth in MSgg medium depends on the production of 
glutamine from externally supplied glutamate and self-produced or scavenged 
ammonium. Glutamine limitation was monitored using yellow fluorescent 
protein (YFP) expressed from the nasA promoter, which is activated upon 
glutamine limitation”’. b, Addition of 1 mM glutamine (blue shading) represses 
expression from the PnasA-YFP reporter (black), but does not affect expression 
from a constitutive reporter (Phyperspank-CFP + 1mM IPTG, grey). 

c, Growth area (see Methods, “Data analysis’ section) before and after addition 


30 
Time (h) 


35 40 


of 1 mM glutamine to an oscillating colony. d, Of the two nutrients required for 
glutamine production, externally supplied glutamate (green) is most abundant 
in the biofilm periphery, while biofilm-produced ammonium (red) is most 
abundant in the biofilm interior. e, Maximum intensity projection over one 
period of a colony oscillation, made from a difference movie (Methods, ‘Data 
analysis’ section), which shows regions of growth (white) and no growth 
(black). Scale bar, 100 jim. f, Growth area of an oscillating colony before and 
after addition of 30 mM glutamate (green shading). g, Growth area of an 
oscillating colony before and after addition of 1 mM ammonium (red shading). 
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response to a lack of glutamine”’. Results show that biofilms indeed 
experience glutamine limitation during growth. Specifically, supple- 
mentation of growth media directly with glutamine reduced nasA 
promoter expression, but did not affect expression of a constitutive 
promoter, confirming glutamine limitation within the biofilm 
(Fig. 2b). More strikingly, addition of exogenous glutamine elimi- 
nated periodic halting of biofilm growth (Fig. 2c and Extended Data 
Fig. 3a). These findings suggest that glutamine limitation plays a 
critical part in the observed oscillations during biofilm expansion. 

The synthesis of glutamine requires both glutamate and ammo- 
nium; therefore, we investigated which of these substrates could 
be responsible for the observed glutamine limitation. Glutamate is 
provided in the media and is thus readily available to cells in the 
periphery of the biofilm. However, consumption of glutamate by 
peripheral cells is likely to limit its availability to cells in the biofilm 
interior (Fig. 2d). One may thus expect that oscillations in biofilm 
expansion could be due to periodic pausing of cell growth in the 
biofilm interior. Accordingly, we set out to establish whether interior 
or peripheral cells exhibited changes in growth. By tracking physical 
movement within the biofilm, we uncovered that only peripheral 
cells grow, and that oscillations in biofilm expansion therefore arise 
exclusively from periodic halting of peripheral cell growth (Fig. 2e, 
Supplementary Video 3, Extended Data Fig. 4a and Methods, “Data 
analysis’ section). This finding was further confirmed by single-cell 
resolution analysis that directly showed periodic reduction in the 
growth of peripheral cells (Extended Data Fig. 4b). This surprising 
pausing of cell growth in the periphery, despite unrestricted access to 
glutamate, suggests that glutamate cannot be the limiting substrate for 
glutamine synthesis. Consistent with this expectation, biofilm oscilla- 
tions were not quenched by supplementation of the media with 
glutamate (Fig. 2f). Therefore, it is not glutamate but ammonium that 
appears to be the limiting substrate for glutamine synthesis in the 
biofilm periphery. 

Because cells can self-produce ammonium from glutamate, we next 
sought to determine how peripheral cells could experience periodic 
ammonium limitation despite a constant supply of glutamate in the 
media. It is well known that ammonium production is a highly regu- 
lated process that is dependent on the metabolic state of the cell and 
the ambient level of ammonium in the environment”. In particular, 
since ammonium is in equilibrium with ammonia vapour, which can 
freely cross the cell membrane and be lost to the extracellular media”’, 
the production of ammonium is known as a ‘futile cycle’. Cells there- 
fore preferentially use extracellular (ambient) ammonium for growth, 
rather than producing their own***. Since peripheral cells are 
exposed to media flow, they are particularly susceptible to this futile 
cycle of ammonia loss. In this sense, as ammonium is not provided in 
the media, even if all cells produce ammonium, the biofilm interior 
will be the major source for ambient ammonium (Fig. 2d). 
Consequently, the simplifying hypothesis is that growth of peripheral 
cells relies on ammonium produced within the biofilm. To test this 
conjecture, we supplemented the media with 1mM ammonium, 
which eliminated the periodic halting in biofilm expansion (Fig. 2g 
and Extended Data Figs 3b and 5a). When additional ammonium was 
suddenly removed from the media, growth in the biofilm periphery 
halted, as expected (Extended Data Fig. 5b). These findings indicate 
that peripheral cells preferentially rely on extracellular ammonium 
produced within the biofilm for their growth. 


Metabolic co-dependence within the biofilm 

The results described above evoke the possibility that ammonium 
limitation for peripheral cells may arise due to glutamate limitation 
for interior cells. Specifically, persistent consumption of glutamate by 
peripheral cells can deprive the interior cells of the necessary glutam- 
ate for ammonium production. To explore this nontrivial hypothesis, 
we turned to mathematical modelling to develop a conceptual frame- 
work and generate experimentally testable predictions. Our model 
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Figure 3 | Mathematical modelling of a spatial metabolic feedback loop 
gives rise to oscillations consistent with experimental data. a, The 
production of ammonium in the interior is limited by and at the same time 
triggers the consumption of glutamate in the periphery (green and red arrows, 
respectively), producing a delayed negative feedback loop. b, The excess 
glutamate not consumed by the biofilm periphery diffuses to the interior, where 
it can be converted into ammonium (green arrows). The ammonium in turn 
enhances growth in the periphery (red arrow) and consequently reduces the 
supply of glutamate to the interior. Model predictions are shown in 
c-h. ¢, Biofilm growth over time. d, Glutamate concentration over time. 
e, Ammonium concentration over time. f, Colony growth before and after 
glutamine addition (indicated by blue shading). g, Colony growth before and 
after addition of glutamate (green shading). h, Colony growth before and after 
addition of ammonium (red shading). 


describes separately the metabolic dynamics of interior and peripheral 
cells and the metabolite exchange between them, where the distinc- 
tion of the two subpopulations depends on nutrient availability 
(see Supplementary Information, “Mathematical Model’ section). 
The model thus consists of two main assumptions (Fig. 3a). First, 
consumption of glutamate during growth of peripheral cells deprives 
interior cells of this nutrient and thus inhibits ammonium production 
in the biofilm interior. Second, the growth of peripheral cells depends 
predominantly on ammonium that is produced by metabolically 
stressed interior cells. A model based on these two simplifying 
assumptions (Fig. 3b) generates oscillations consistent with our 
experimental observations (Fig. 3c-e) and reproduces the effects of 
supplementing the media with glutamine, glutamate and ammonium 
(Fig. 3f-h, Extended Data Fig. 6 and Supplementary Information, 
“Mathematical Model’ section). The model also accounts for the 
observed slight increase of the oscillation period by considering an 
increase in the ratio of interior to peripheral cells over time (Extended 
Data Figs 1b and 6f). Therefore, this simple model shows that periodic 
halting in biofilm growth can result from metabolic co-dependence 
between cells in the biofilm periphery and interior that is driven by 
glutamate consumption and ammonium production, respectively. 
The metabolic co-dependence between interior and peripheral cells 
gives rise to the surprising prediction that external attack could pro- 
mote growth within the biofilm. Specifically, killing of peripheral cells 
will eliminate their glutamate consumption, which will increase glu- 
tamate availability in the biofilm and thereby promote growth of 
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Figure 4 | Metabolic co-dependence between interior and peripheral cells 
gives rise to oscillations that make the colony more resilient to external 
attack. a, Visual representation of the predicted outcome of an external attack 
on biofilm growth. b, Phase contrast merged with cell death marker (cyan, 1 uM 
Sytox green) images of a wild-type (WT) biofilm region shows cell death with 
and without challenge by 2% (v/v) HzO . Scale bar, 50 um. c, In the same 
biofilm, difference images (white regions indicate cell growth) show wild-type 
growth with and without challenge by HO. d, Overexpression of glutamate 
dehydrogenase (GDH, pink) promotes more production of ammonium from 
glutamate. e, Experimental (top) and modelling results (bottom) of GDH 
overexpression (induced with 1 mM IPTG, indicated by pink shading). f, Phase 


interior cells (Fig. 4a). To test this hypothesis, we measured cell death 
and growth within oscillating biofilms (Fig. 4b, top, and Extended 
Data Fig. 7). When we exposed the biofilm to media containing 
hydrogen peroxide (H2O2), we observed increased cell death predo- 
minantly in the biofilm periphery (Fig. 4b, bottom, and Extended 
Data Fig. 8). As predicted, death of peripheral cells led to growth of 
interior cells (Fig. 4c and Extended Data Fig. 8). To verify that this 
response is not uniquely triggered by H,O2, we exposed biofilms to 
the antibiotic chloramphenicol and again observed growth of interior 
cells (Extended Data Fig. 8). These findings further support our hypo- 
thesis that glutamate consumption by peripheral cells limits its avail- 
ability in the biofilm. 


The benefit of biofilm oscillations 


Our model also assumes that glutamate starvation of the biofilm 
interior reduces the production of ammonium that can support peri- 
pheral cell growth. This assumption provokes the question as to why 
peripheral cells do not simply overcome their dependence on extra- 
cellular ammonium by increasing intracellular production””’*. To 
address this question, we constructed a strain that contains an indu- 
cible copy of the GDH gene rocG (Fig. 4d). We confirmed that GDH 
overexpression was not toxic to individual cells and did not affect their 
growth rate (Extended Data Fig. 9). In contrast, the induction of GDH 
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contrast merged with cell death marker (cyan, 1 1M Sytox green) images of a 
colony overexpressing GDH with and without challenge by HO . g, In the 
same biofilm, difference images show cell growth during GDH overexpression 
alone, and with challenge by H,O . h, Quantification of total biofilm growth 
rate in wild-type (top, n = 4 colonies) and GDH overexpression (bottom, n = 3 
colonies) strains upon challenge with H,O). Error bars represent standard 
deviations. Modelling data are shown as an inset for each strain. 

i, Co-dependence between interior and peripheral cells exhibited in a wild-type 
strain results in a growth strategy that sustains the viability of interior cells, 
while independence enforced by a GDH overexpression strain results in 
starvation of interior cells and reduced resilience to external attack. 


expression in the biofilm quenched growth oscillations (Fig. 4e and 
Extended Data Fig. 3c) and resulted in high levels of cell death in the 
colony interior (Fig. 4f, top). This result explains why peripheral cells 
do not appear to utilize the simple strategy of overcoming their 
dependence on extracellular ammonium: such a strategy would result 
in the continuous growth of peripheral cells, starving and ultimately 
causing the death of sheltered interior cells within the biofilm. 
Periodic halting of peripheral cell growth due to extracellular ammo- 
nium limitation thus promotes the overall viability of the biofilm. 
The ability of the biofilm to regenerate itself in the event of an 
external attack suggested that killing the biofilm interior first would 
be a more effective strategy for biofilm elimination. Accordingly, we 
exposed the GDH overexpression strain to hydrogen peroxide and 
again measured growth and death. As described above, GDH induc- 
tion causes death of interior cells. Exposing the GDH overexpression 
strain to hydrogen peroxide resulted in more effective global killing 
throughout the biofilm (Fig. 4f, g, bottom). While in the wild-type 
biofilm, interior cells begin to grow in response to an external attack, 
metabolic independence between interior and peripheral cells in the 
GDH strain interferes with this defence mechanism (Fig. 4h). This 
outcome is also consistent with modelling predictions (Fig. 4h, 
inset). Oscillations in biofilm growth that are driven by metabolic 
co-dependence thus promote the resilience of the biofilm community 
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by sustaining the viability of the sheltered interior cells that are most 
likely to survive in the event of an environmental stress (Fig. 4i). 


Discussion 


The data presented here reveal that intracellular metabolic activity 
within biofilms is organized in space and time, giving rise to 
co-dependence between interior and peripheral cells. Even though 
bacteria are single-celled organisms, the metabolic dynamics of indi- 
vidual cells can thus be regulated in the context of the community. 
This metabolic co-dependence can, in turn, give rise to collective 
oscillations that emerge during biofilm formation and promote the 
resilience of biofilms against chemical attack. The community-level 
oscillations also support the ability of biofilms to reach large sizes, 
while retaining a viable population of interior cells. Specifically, 
periodic halting of peripheral cell growth prevents complete star- 
vation and death of the interior cells. This overcomes the colony 
size limitation for a viable biofilm interior that would otherwise be 
imposed by nutrient consumption in the biofilm periphery. Metabolic 
co-dependence in biofilms therefore offers an elegant solution that 
resolves the social conflict between cooperation (protection) and 
competition (starvation) through oscillations. 

The discovery of biofilm oscillations presented here also raises new 
questions. While cellular processes such as swarming or expression of 
extracellular matrix components are not required for the observed 
biofilm oscillations (Extended Data Fig. 10), it will be interesting to 
pursue whether such cellular processes are influenced by oscillatory 
dynamics”. Another question worth pursuing is whether metabolic 
co-dependence can also arise in other biofilm-forming species. 
Perhaps other metabolic branches where metabolites can be shared 
among cells could also give rise to oscillations in biofilm growth. It will 
be interesting to pursue these questions in future studies to obtain a 
better understanding of biofilm development. 

Our observations also suggest future strategies to cope with the 
intriguing resilience of biofilms in the face of environmental stresses, 
such as antibiotic exposure. In particular, our findings show that 
straightforward application of stress (such as H,O, or chlorampheni- 
col) to the biofilm counterintuitively promotes growth, effectively 
rejuvenating the biofilm. Death of the colony periphery relieves the 
repression on the growth of interior cells, allowing them to regenerate 
a new biofilm periphery and interior. In contrast, manipulation of the 
metabolic co-dependence may yield a more effective approach to 
control biofilm formation. Specifically, promoting continuous growth 
of peripheral cells can starve the biofilm interior, leaving behind the 
exposed peripheral cells that can more easily be targeted by external 
killing factors. Therefore, the metabolically driven collective oscilla- 
tions in biofilm expansion described here not only reveal fundamental 
insights into the principles that govern formation of multicellular 
communities, but also suggest new strategies for manipulating the 
growth of biofilms. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Strains and plasmids. All experiments were done using Bacillus subtilis NCIB 
3610. The wild-type strain was a gift from W. Winkler (University of Maryland)” 
and all other strains were derived from it and verified by sequencing (see 
Supplementary Information). 

Growth conditions. The biofilms were grown using MSgg medium’®. It contains 
5mM potassium phosphate buffer (pH 7.0), 100mM MOPS buffer (pH 7.0, 
adjusted using NaOH), 2mM MgCh, 700M CaCl:, 50M MnCl, 100 1M 
FeCl;, 1 4M ZnCl, 2 uM thiamine HCl, 0.5% (v/v) glycerol and 0.5% (w/v) mono- 
sodium glutamate. The MSgg medium was made from stock solutions on the day of 
the experiment, and the stock solution for glutamate was made new each week. 
Microfluidics. We used the CellASIC ONIX Microfluidic Platform and the Y04D 
microfluidic plate (EMD Millipore). It provides unconventionally large cham- 
bers, allowing the formation of colonies containing millions of cells, yet still leaves 
room for media flow. Media flow in the microfluidic chamber was driven by a 
pneumatic pump from the CellASIC ONIX Microfluidic Platform, and the pres- 
sure from the pump was kept stable during the course of the oscillation. In most of 
the experiments, we used a pump pressure of 1 psi with only one media inlet open 
(there are 6 media inlets in the Y04D plate), which corresponds to a flow speed of 
~16 pms | in the growth chamber. 

On the day before the experiment, cells from —80°C glycerol stock were 
streaked onto LB agar plate and incubated at 37 °C overnight. The next day, a 
single colony was picked from the plate and inoculated into 3 ml of LB broth ina 
50 ml conical tube, and then incubated at 37 °C in a shaker. After 2.5 h of incuba- 
tion, the cell culture was centrifuged at a relative centrifugal force of 2,100 for 1 
min, and then the cell pellet was re-suspended in MSgg and then immediately 
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loaded into microfluidics. After the loading, cells in the microfluidic chamber 
were incubated at 37 °C for 90 min, and then the temperature was kept at 30 °C for 
the rest of the experiment. 
Time-lapse microscopy. The growth of the biofilms was recorded using phase 
contrast microscopy. The microscopes used were Olympus [X81 and IX83, and 
DeltaVision PersonalDV. To image entire biofilms, 10 X lens objectives were used 
in most of the experiments. Images were taken every 10 min. Whenever fluor- 
escence images were recorded, we used the minimum exposure time that still 
provided a good signal-to-noise ratio. 
Data analysis. ImageJ (National Institutes of Health) and MATLAB (MathWorks) 
were used for image analysis. In-house software was also developed to perform 
colony detection and quantification of colony expansion. Multiple methods of 
colony detection were used to ensure the accuracy of the analysis. To detect regions 
of expansion in a biofilm, we performed image differencing on snapshots of the 
biofilm from time-lapse microscopy videos. Specifically, we calculated the difference 
between two consecutive phase contrast images (taken 10 min apart) by finding the 
absolute difference between each pixel in each image. We then generated an image 
stack based on these results. The intensity values from the stack correlate with the 
expansion inside the biofilm. The growth area was determined by converting dif- 
ference images to binary images and then measuring the area of the colony growth 
region (white pixels). To measure cell replication time, we tracked the length and 
division of individual cells in the biofilm periphery (Extended Data Fig. 4b). 

No statistical methods were used to predetermine sample size. 


30. Irnov, |. & Winkler, W. C. A regulatory RNA required for antitermination of biofilm 
and capsular polysaccharide operons in Bacillales. Mol. Microbiol. 76, 559-575 
(2010). 
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Extended Data Figure 1 | Characterization of biofilm growth oscillations. Bottom: period of each oscillation cycle, measured peak to peak. The error bars 
a, Top: growth rate over time ofan oscillating colony. Bottom: the pressure that (+20 min) are determined by the imaging frequency (1 frame per 10 min). 
drives media flow in the microfluidic chamber is constant over time (see The period slightly increases over time (see also Extended Data Fig. 6f and 
Methods, ‘Microfluidics’ section). b, Top: growth rate of an oscillating colony. | Supplementary Information, ‘Mathematical Model’). 
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Extended Data Figure 2 | Roles of carbon and nitrogen in biofilm growth —_ Bacillus subtilis NCIB 3610 has two glutamate dehydrogenases (GDH), rocG 
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Extended Data Figure 4 | Measurements of cell growth within oscillating b, Top left: schematic of a biofilm. Top right: high magnification phase contrast 
biofilms. a, Top: visual representation of the method through which difference _ image of biofilm periphery focused at the bottom layer of cells. Bottom 
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(middle) and growth area over time (bottom) of an oscillating colony. Dashed _ panel. The periodic slowdown of the growth of individual peripheral cells is 
lines show the position of each image on the time trace. Scale bar, 100 um. responsible for the observed periodic reduction in biofilm expansion. 
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Extended Data Figure 6 | Mathematical model of biofilm growth. a, The 
model describes the dynamics of two cell populations in a biofilm, interior and 
peripheral. As the biofilm grows, there is a constant distance between the 
interior population and the biofilm edge. b-e, Bifurcation diagrams showing 
systematic analysis on the effects of external glutamine, external glutamate, 
ammonium uptake, and GDH overexpression, respectively. The red lines 
correspond to the extrema of oscillations in peripheral glutamate (stable limit 
cycle). The solid black line denotes stable fixed point. The dashed black line 


corresponds to an unstable fixed point. The vertical grey lines highlight the state 
of the system for each nutrient addition experiment shown in Fig. 3. f, Model 
prediction of oscillation period as function of interior cell fraction in the 
whole biofilm. g, h, Sensitivity analysis of oscillation period (g) and modulation 
depth (h) to changes in model parameters. Modulation depth is defined as 
the amplitude of the oscillations divided by the mean value. Grey colour 
denotes parameter regions where the system does not oscillate. 
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Extended Data Figure 7 | Temporal profile of cell death within an oscillating biofilm. Top: colony growth rate. Bottom: average fluorescence intensity of a cell 
death marker (Sytox green, 1 1M, Life Technologies) from the same colony shown in the top panel. 
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Extended Data Figure 8 | Effect of external attack with hydrogen peroxide _image differencing (see Extended Data Fig. 4a and Methods, ‘Data analysis’). 
(HO), 0.15% v/v) or chloramphenicol (CM, 5 pg ml~"). Top: cell death Scale bar, 100 j1m. The white dashed lines indicate colony edge. 
shown by Sytox green (1 j1M). Middle and bottom: colony growth shown by 
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Extended Data Figure 9 | Effect of GDH induction on cell growth. Wild- 

type and Phyperspank-RocG (uninduced or induced with 10 mM IPTG) strains 
were grown in liquid culture (MSgg medium, 30 °C). Cell generation times were 
measured using OD¢oo. Error bars show standard deviations (n = 3 replicates). 
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mutant strains. a, opp operon deletion (deficient in quorum sensing). b, comX — swarming). These results show that the corresponding genes and processes are 
deletion (deficient in quorum sensing). ¢, tapA operon deletion (extracellular _not required for biofilm oscillations. 
matrix component deletion). d, tapA operon overexpression (Phyperspank- 
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Biogenesis and structure of a type VI 
secretion membrane core complex 


Eric Durand!?*:4>*, Van Son N guyen?*, Abdelrahim Zoued!*, Laureen Logger’, Gérard Péehau-Arnaudet’*, 
Marie-Stéphanie Aschtgen!, Silvia Spinelli”, Aline Desmyter?”, Benjamin Bardiaux*®, Annick Dujeancourt*“, 
Alain Roussel”, Christian Cambillau””, Eric Cascales! & Rémi Fronzes** 


Bacteria share their ecological niches with other microbes. The bacterial type VI secretion system is one of the key 
players in microbial competition, as well as being an important virulence determinant during bacterial infections. It 
assembles a nano-crossbow-like structure in the cytoplasm of the attacker cell that propels an arrow made of a 
haemolysin co-regulated protein (Hcp) tube and a valine-glycine repeat protein G (VgrG) spike and punctures the 
prey’s cell wall. The nano-crossbow is stably anchored to the cell envelope of the attacker by a membrane core 
complex. Here we show that this complex is assembled by the sequential addition of three type VI subunits (Tss)— 
TssJ, TssM and TssL—and present a structure of the fully assembled complex at 11.6 A resolution, determined by 
negative-stain electron microscopy. With overall C; symmetry, this 1.7-megadalton complex comprises a large base 
in the cytoplasm. It extends in the periplasm via ten arches to form a double-ring structure containing the 
carboxy-terminal domain of TssM (TssM.;) and TssJ that is anchored in the outer membrane. The crystal structure of 
the TssM,,-TssJ complex coupled to whole-cell accessibility studies suggest that large conformational changes induce 


transient pore formation in the outer membrane, allowing passage of the attacking Hcp tube/VerG spike. 


In the environment, bacteria have evolved collaborative or aggressive 
mechanisms to communicate, exchange information and chemicals, 
or compete for space and resources’ *. One of the main weapons of 
bacterial conflicts is a multi-protein device called the type VI secretion 
system (T6SS) that is assembled in the attacker bacterium*. The T6SS 
is a versatile nanomachine that can deliver toxin proteins directly not 
only into prey prokaryotes but also into eukaryotic cells during bac- 
terial infections**-°. Anti-host activities have been shown to inhibit 
phagocytosis and therefore to disable macrophages, while the anti- 
bacterial activities allow the bacterium to destroy competitors and to 
have a privileged access to the niche, to nutrients or to new DNA*””. 
The T6SS is composed of 13 different proteins, encoded by genes that 
are usually clustered". It assembles a tubular puncturing device that is 
evolutionarily, structurally and functionally similar to the tail of con- 
tractile bacteriophages. Its sheath is a tubular structure, hundreds of 
nanometres long, that extends in the cytoplasm and is built by the 
polymerization of TssBC building blocks'*"™*. It is assembled on an 
assembly platform, the baseplate’*’*'’, and maintained in an 
extended, metastable conformation’*"*. The attacking arrow, 
wrapped by the sheath, comprises an inner tube that is built by stacked 
Hcp hexameric rings’ and tipped by a puncturing spike composed of 
VgrG”’. Upon contact with the prey, structural rearrangements of the 
sheath subunits induce its contraction and propulsion of the Hcp 
tube/VgrG spike towards the target cell, allowing toxin delivery'®’””". 
The phage-like T6SS tail is anchored to the attacker cell membrane by 
a trans-envelope complex”. This membrane complex not only serves 
as a docking station but has been proposed as a channel for the passage 
of the inner tube after sheath contraction, thereby preventing mem- 
brane damage in the attacker'*’”. The membrane core complex of the 
T6SS (that is, the minimal module required to function and conserved 


in all T6SS) is composed of the TssL and TssM inner-membrane pro- 
teins and the TssJ outer membrane lipoprotein’*'”””*. These proteins 
are connected through a network of interactions between TssM and 
TssL, and TssM and TssJ****”°. Although the localization and topology 
of these subunits, their interactions and the crystal structures of the 
soluble domains of TssJ and TssL have been described’”**-”’, we still 
lack crucial information on the biogenesis and overall architecture of 
this complex and how it is used as a channel during T6SS action. 


Localization, dynamics and biogenesis of the T6SS 
membrane core complex 


We first sought to determine the assembly pathway of the entero- 
aggregative Escherichia coli (EAEC) T6SS membrane core complex. 
Strains producing fluorescently labelled T6SS membrane subunits 
were engineered. The sequence encoding the super-folder green fluor- 
escent protein (sfGFP) was inserted upstream of the stop codon of the 
tssJ gene or downstream of the start codon of the tssL and tssM genes. 
In these constructs, the fusion proteins were produced from their 
native chromosomal loci. Hcp release and anti-bacterial assays 
demonstrated that the sfGFP-TssL and sfGFP-TssM fusion proteins 
were functional (Extended Data Fig. la). By contrast, strains pro- 
ducing T’ssJ-sfGFP or TssJ-mCherry had a non-functional T6SS 
(Extended Data Fig. 1b). Fluorescence microscopy analyses showed 
that sfGFP-TssL and sfGFP-TssM cluster at discrete positions at the 
cell periphery, in agreement with their membrane localization (Fig. la 
and Extended Data Fig. 1c). These foci are stable and static (Fig. la 
and Extended Data Fig. 1d). Statistical analyses further showed that 
one or two foci are observable in cells expressing the T6SS (Fig. 1b) 
and that these clusters are randomly distributed around the cell 
(Fig. 1c). Co-localization experiments with strains producing 
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Figure 1 | Biogenesis of the T6SS membrane-associated core complex. 

a, Time-lapse fluorescence microscopy recordings showing localization and 
dynamics of the sfGFP-TssM and sfGFP-TssL fusion proteins. Individual 
images were taken every 30 s. The positions of the foci are indicated by 
arrowheads. Scale bars, 1 tum. Larger fields are presented in Extended Data 
Fig. 1c. b, Statistical analysis of sfGFP-TssM and sfGFP-TssL localization. 
Shown are box-and-whisker plots of the measured number of sfGFP-TssM and 
sfGFP-TssL foci per cell for each strain with the lower and upper boundaries of 
the boxes corresponding to the 25% and 75% percentiles respectively. The black 
horizontal bar represents the median values for each strain and the whiskers 
represent the 10% and 90% percentiles. The number of cells studied per 
strain is indicated above the bars. c, Spatial repartition of the sfGFP-TssM and 
sfGFP-TssL foci. Shown is a superposition of the different foci analysed in a 
single cell. d, sfGFP-TssM and mCh-TssL proteins co-localize. Fluorescence 
microscopy recordings showing co-localization between sfGFP—TssM and 


D 


sfGFP-TssM and mCherry-tagged TssL showed that the two subunits 
are present in the foci, demonstrating that each focus corresponds to 
the position of an assembled membrane complex (Fig. 1d). To test 
whether these foci are used to anchor the phage-like tail tubular 
structure, mCherry was fused to the tssB sheath gene, at its original 
chromosomal locus in the strain producing sfGFP-TssL. Time-lapse 
recordings showed that T6SS sheathes polymerize and extend from 
the membrane complex (Fig. le). On the basis of these results, we 
conclude that membrane complexes comprising TssL and TssM (and 
probably TssJ) assemble at discrete positions in the cell and are then 
used to recruit the tail-complex subunits. Statistical analyses showed 
that the number of sfGFP-TssL or sfGFP-TssM foci per cell is higher 
than the number of sheathes (Extended Data Fig. le), suggesting that 
the membrane complexes exist in a pre-assembled form. Interestingly, 
long-term time-lapse recordings showed that these membrane com- 
plexes can be re-used for new tail polymerization events (Extended 
Data Fig. 1f). To gain further information on the biogenesis of this 
initial step, tssJ or tssM were deleted in the sfGFP-TssL-producing 
strain, and tssJ or tssL were deleted in the sfGFP-TssM-producing 
strain. The rationale behind these experiments is that if a protein 
assembled early is missing, the recruitment of late proteins will be 
affected, yielding a diffuse fluorescent signal. Figure 1f shows that 
the recruitment of sfGFP-TssM and sfGFP-TssL is affected in the 
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mCh-TssL fusion proteins. The positions of the foci are indicated by the 
arrowheads. Scale bar, 1 jm. e, The membrane complex serves as a docking site 
for tail sheath polymerization. Time-lapse fluorescence microscopy recordings 
showing co-localization between sfGFP-TssL and TssB-mCh fusion proteins. 
Individual images were taken every 30 s. Assembly/contraction of the sheath 
and TssL localization events is schematized in the bottom row of panels. Scale 
bars, 1 jum. f, Assembly pathway of the T6SS TssJLM membrane complex. 
Fluorescence microscopy recordings showing sfGFP-TssM and sfGFP-TssL 
localization in the absence of the TssJ or TssL and TssJ or TssM proteins 
respectively. The positions of the foci are indicated by the arrowheads. Scale 
bars, 1 um. The quantification of the sfGFP-TssM and sfGFP-TssL clusters per 
cell is presented in Extended Data Fig. 1g. g, Schematic representation of the 
sequential biogenesis of the T6SS membrane complex. The names of the 
proteins, their localizations and topologies are shown. 


absence of TssJ, and that of sfGFP-TssL is affected in the absence of 
TssM. Conversely, the absence of TssL had no effect on TssM recruit- 
ment (Fig. 1f and Extended Data Fig. 1g). On the basis of these results, 
we conclude that TssJ is used as a nucleation factor and that the 
biogenesis of the T6SS membrane core complex is pursued by the 
inward sequential addition of TssM and TssL (Fig. 1g). 


Architecture of the T6SS membrane core complex 


To gain further insights on the architecture of the T6SS membrane 
core complex, the tssJ, tssL and tssM genes were co-expressed in 
E. coli BL21(DE3). Constructs were designed to add StrepII, Flag 
and 6XHis tags at the carboxy (C) terminus of TssJ, amino (N) 
terminus of TssL and N terminus of TssM, respectively (Extended 
Data Fig. 2). Total membranes were isolated and solubilized using 
detergents. Two-step affinity chromatography followed by gel filtra- 
tion resulted in the purification of a complex containing TssJ, TssL 
and TssM (Fig. 2a and Extended Data Fig. 2f-h). In this complex, we 
determined the TssM-TssL stoichiometry as 1 to 1 (Extended Data 
Fig. 2h). Purified complexes were visualized by negative-stain elec- 
tron microscopy (EM) (Fig. 2b and Extended Data Fig. 3a). A data set 
was collected, and reference-free classification and averaging revealed 
characteristic views of the complex (class averages) (Fig. 2b). We 
observed rocket-shaped and ring-shaped views corresponding to side 
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Figure 2 | TssJLM complex purification and structure. a, SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) analysis of the purified 
EAEC TssJLM complex. The bands corresponding to TssM (130 kDa), TssL 
(24 kDa) and TssJ (18 kDa) after SDS-PAGE and Coomassie blue staining 
are indicated. b, Representative views (class averages) of purified TssJLM 
complexes. End to side views are shown from top left to bottom right. Scale bar, 
10 nm. ¢, Structure of the TssJLM complex. Side, cut-away, bottom and top 
views are shown from top left to bottom right respectively. The different 
regions of the complex are indicated on the cut-away view. 


and end views of the T6SS membrane core complex respectively 
(Fig. 2b). Rotational symmetry analysis of end-view class-averages 
revealed a clear five-fold symmetry in the whole TssJLM population 
(Extended Data Fig. 3b). The complex is composed of a base and a tip 
complex linked by arches (Fig. 2b). The negative-stain data set was 
used to reconstruct a 11.6-A resolution three-dimensional (3D) 
volume of the complex with five-fold symmetry applied (Fig. 2c and 
Extended Data Fig. 3c, d). Local resolution calculations using 
ResMap” indicated that the local resolution was significantly lower 
in the base (Extended Data Fig. 3e, f). This impaired a correct inter- 
pretation of this part of the TssJLM map. Since this could be due to 
flexibility between the base and the rest of the complex, we performed 
a local 3D refinement on the base region only, which yielded a 3D 
reconstruction of the base at 16.6-A resolution. A composite map 
where this new reconstruction of the base replaces the equivalent 
densities in the reconstruction of the whole complex is shown in 
Fig. 2c. The T6SS membrane core complex is 300 A in height and 
205 A in diameter (Fig. 2c). It is made of a base that is decorated at its 
bottom by five hooks and is pierced at its centre by a small hole of 15 A 
in diameter (Fig. 2c). Ten arches connect this base to a tip complex of 
160 A in diameter covered by a small cap. Remarkably, five arches 
gather at the centre of the tip complex to define a 15- to 20-A diameter 
channel. The five other arches form a scaffold at the periphery of this 
complex (Extended Data Fig. 4a). Overall, the tip complex is made of 
internal and external pillars arranged in concentric rings (Extended 
Data Fig. 4a). 
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To define how the core complex is inserted in the cell envelope, we 
first performed differential solubilization of the inner and outer mem- 
branes. The total membrane fraction was solubilized with N-lauryl 
sarkosyl, a detergent that preferentially solubilizes inner-membrane 
proteins. This differential solubilization resulted in the fractionation 
of the core complex in both inner and outer membrane fractions 
(Extended Data Fig. 4b), indicating that this complex resides in both 
membranes. To determine its orientation in the cell envelope, the puri- 
fied core complex was incubated with anti-StrepII antibodies or Ni- 
NTA-coated gold particles targeting the TssJ C-terminal StrepII and 
TssM N-terminal 6XHis tags respectively (Extended Data Fig. 4c), 
before EM analyses. Anti-StrepII antibodies labelled the tip complex/ 
cap while the base was labelled by the Ni-NTA gold particles (Extended 
Data Fig. 4c). We concluded that the TssJ C terminus is located in the 
tip complex while the TssM N terminus is located in the base (Extended 
Data Fig. 4c). When the N-terminal cysteine residue of the TssJ lipo- 
protein was substituted by Ser (C1S) to prevent its acylation, an intact 
TssJ©'S-L-M core complex was formed (Extended Data Fig. 4b), but 
differential solubilization proved the complex mis-localized to the inner 
membrane fraction only (Extended Data Fig. 4b). Hence, TssJ acylation 
tethers the apex of the complex to the outer membrane whereas the base 
of the complex is located in the cytoplasm. 

We next analysed the EM reconstruction to assign the different 
regions of the core complex to its components. The volume corres- 
ponding to one arch and the corresponding pillar within the tip com- 
plex (Extended Data Fig. 4a) is comparable in size and shape to that of 
the isolated TssM periplasmic domain (amino acids 386-1129; 
TssMp) in complex with TssJ obtained by small-angle X-ray scatter- 
ing (SAXS) (Extended Data Fig. 4d, e). Segmentation of this volume 
yielded five different sub-volumes (Fig. 3a). We propose that the sub- 
volume closest to the cap corresponds to TssJ, in agreement with its 
location close to the outer membrane. The other four sub-volumes 
would correspond to sub-domains of TssMp. Sub-volume 4 is in close 
contact with TssJ, suggesting that it corresponds to the C-terminal 
domain of TssM domain, which was previously shown to mediate 
contact with TssJ**. With sub-volume 3, it forms the tip complex while 
sub-volumes 1 and 2 correspond to the arches (Fig. 3a). Interestingly, 
the last TssM transmembrane segment crossing the inner membrane 
is located just upstream of TssMp. This would place the inner mem- 
brane at the bottom of the arches or at the top of the base. The volume 
of the base (1,450 A*) is much bigger than the estimated volume 
occupied by ten copies of the cytoplasmic domains of TssM and 
TssL (825 A*). The crystal structure of the TssL cytoplasmic domain 
dimer*®”’ could be fitted in the hooks with 88% correlation (Extended 
Data Fig. 4f). This indicates that the remainder of the base could 
correspond to the cytoplasmic domain of TssM and the 40 transmem- 
brane segments bound to detergent (Extended Data Fig. 4f). 

To gain more insight into the structure of the TssMp-TssJ complex, 
TssMp was produced and purified as described previously*’. To help 
crystallization, TssMp complex was subjected to controlled proteolytic 
digestion’’. A protease-resistant fragment of an apparent size of ~32 
kDa (called hereafter TssM32c4 residues 836-1129; Extended Data 
Fig. 5a) was further purified and co-crystallized with nb25, a specific 
camelid single-chain nanobody*’’. The structure of the TssM3¢,-nb25 
complex was solved by molecular replacement using the X-ray struc- 
ture of nb25 reported previously (Extended Data Fig. 5b and 
Extended Data Table 1). In the complex, the TssM32<; amino-acid 
chains are defined in the electron density map between residues 868 
and 1107. We therefore purified a new TssMp fragment (TssM¢c;) 
encompassing the crystallographic visible chain. This shorter fragment 
crystallized readily alone as well as in complex with the unacylated Tss] 
protein (Extended Data Table 1). The structure of TssMz¢ct is com- 
posed of two domains. The N-terminal domain (residues 870-974) is a 
bundle of four a-helices, covered on one side by a B-hairpin (Fig. 3b) 
and on the other by the C-terminal elongated stretch of the protein. The 
C-terminal domain (residues 975-1085) is a nine-stranded B-sandwich 
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that contacts nb25 or TssJ (Fig. 3b and Extended Table 2a, b). This 
C-terminal domain is followed by a stretch of residues (1086-1107) 
comprising helix «5 (Fig. 3b). T'ssJ binds to the apex of the C-terminal 
domain, and the 590-A? interaction area involves contacts between TssJ 
loops L1-2, L3-4 and L5-6 with TssMo6c; loops L3-4 and L5-6 
(Extended Data Fig. 5c and Extended Data Table 2b), in agreement 
with a previous study demonstrating the importance of T'ssJ loop L1-2 
for TssM-TssJ complex formation’’. Superimposition of the structures 
of TssM3¢;—-nb25 and TssM¢¢,;-TssJ shows that nb25 and TssJ cannot 
bind simultaneously to TssM (Extended Data Fig. 5d), explaining the 
nb25 in vivo inhibitory effect on T6SS function”. The comparison 
between TssM26c:-TssJ crystal structure and the volume proposed to 
correspond to TssJ and domains 3 and 4 of TssMp determined by EM 
resulted in 95% correlation between the two structures (Fig. 3c). This 
confirms the location of TssM2¢c,-TssJ in the tip complex (Fig. 3d). 


Cell surface accessibility and transient pore formation 


The orientation of the TssJ N terminus places the outer membrane 
above TssJ, where the cap is located (Figs 2c and 4a and Extended Data 
Fig. 5e). Accordingly, close inspection of the proposed oligomeric 
structure of the TssM36c,-T'ssJ complex could not reveal any obvious 
transmembrane region (Extended Data Fig. 5f). To test this, we engi- 
neered functional cysteine derivatives between the B-strands of the 
C-terminal domain of TssM (Extended Data Fig. 6a). The extracel- 
lular accessibility of these residues was assessed by incubating whole 
cells with an outer membrane-impermeant cysteine-reactive malei- 
mide. We observed that positions 989, 1005, 1035, 1075 and 1109 
were labelled whereas positions 972, 1019, 1062 and 1092 were not 
(Extended Data Fig. 6b and Extended Data Table 2c). With the excep- 
tion of position 1092, all other positions were labelled when cell lysates 
were used instead of intact cells (Extended Data Table 2c). The 
labelled cysteine substitutions are on the tip of TssM facing the outer 
membrane (Fig. 4a). Interestingly, residues 989 and 1005 are buried at 
the interface with TssJ (Extended Data Fig. 6c). Therefore, for these 
residues to be labelled, the TssM-TssJ complex has to dissociate. This 
result also suggests that the tip of TssMz6c; is exposed to the cell 
exterior. To test whether TssM stably crosses the outer membrane 
or accesses the cell exterior temporarily, similar experiments were 
conducted in a tssBC-deleted background. In the absence of the 
TssB and TssC sheath components, the TssJLM membrane complex 
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Figure 3 | Structure of the TssJLM tip complex. 
a, Segmentation of the TssJLM complex 
reconstruction. Each volume encompassing one 
arch and the corresponding pillar within the 

tip complex is segmented in five different domains 
(shown in different colours). b, Crystal structure of 
the TssMa6c1-TssJso1 complex represented as 
ribbons. TssJ5o) is coloured orange, while TssMo¢6ct 
is coloured cyan (4-domain) and blue (B-domain). 
The C-terminal «5-helix and the extended 

stretch are coloured magenta. The B-hairpin 
(B1-B2) is highlighted in the dashed circle. Two 
orthogonal views of the crystal structure and its 
surface representation are shown from left to 
right (coloured as in a). ¢, TssMa6c;-TssJ crystal 
structure docked into the EM volume 
corresponding to TssJ and the TssM periplasmic 
domains 3 and 4 extracted from both internal 
and external pillars of the tip complex. d, Energy- 
minimized atomic model of the tip complex 
structure (left panel, side view; right panel, 

top view). 


is properly assembled but the T6SS is inactive as no sheath assembly 
or contraction could occur. In tssBC cells, only position 1109 was 
labelled (Extended Data Fig. 6b and Fig. 4a). These results suggest 
that the TssM «5-helix crosses the outer membrane permanently, 
exposing the C-terminal extension to the extracellular medium 
whereas part of TssM 6c; domain is exposed transiently at the cell 
surface during the T6SS mechanism of action. 


Closing remarks and outlook 


The data presented here allow an unprecedented understanding of the 
biogenesis, architecture and role of the T6SS TssJLM membrane core 
complex. This complex anchors the phage tail-like structure to the cell 
envelope and is thought to serve as conduit to guide the Hcp tube/ 
VerG spike upon sheath contraction’*’’. Using fluorescence micro- 
scopy, we demonstrate that the three subunits are recruited in a spe- 
cific order, starting from the outer membrane TssJ lipoprotein and 
pursued by the sequential addition of TssM and TssL, a hierarchy in 
agreement with previously published localization and interaction 
studies'’”’?-*’. Therefore, T6SS biogenesis is initiated by an outer 
membrane lipoprotein nucleation factor and progresses inwards, like 
the assembly mechanisms of other bacterial secretion systems***’. 
Our fluorescence microscopy analyses also showed that the T6SS 
membrane core complex assembles randomly in the cell envelope, 
without specific localization. The complex is stable and can be used 
for several events of sheath assembly/contraction, increasing the 
amount of toxin effectors delivered to the target cell. 

The TssJLM complex has a five-fold symmetry and is composed of 
ten copies of each component that assemble a 1.7-MDa structure 
crossing the inner membrane, the periplasm and anchored to the 
outer membrane via the TssJ N-terminal lipid moiety. Its architecture 
is unique compared with other trans-envelope bacterial secretion 
systems (Extended Data Fig. 7a). On the basis of our accessibility 
experiments, we propose that upon assembly of other T6SS subunits 
with the membrane core complex, the TssM C-terminal extension 
(C-terminal extended stretch following helix «5 in the crystal struc- 
ture and the remaining 22 non-visible amino acids) will change its 
conformation and will cross the outer membrane. The base of the 
TssJLM complex defines a small cavity and hole that cannot accom- 
modate the VgrG protein and potential effectors bound to it (Fig. 4b, 
stages 1 and 2)*”°. We propose that the base changes its conformation 
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Stage 1 


Figure 4 | Cell surface accessibility and mechanism of action of the T6SS 
membrane core complex during secretion. a, Cell surface accessibility studies. 
Crystal structure of the TssM26cr represented as ribbons, coloured cyan 
(a-domain) and blue (B-domain). The C-terminal «5-helix and the extended 
stretch are coloured magenta. The C terminus (lacking in the crystallized 
fragment) is represented as a random structure beyond the last residue in the 
crystallographic model. The cysteine substitutions (in sphere representation) 
used for labelling experiments are positioned in the TssM2¢c; crystal structure. 
Cysteines with extracellular accessibility when the T6SS is active are coloured 
yellow, while the unlabelled ones are coloured grey. b, Model of action. The 
proposed mechanism of action involves five sequential stages. Stage 1: the 
assembled TssJLM complex is not integrally inserted in the outer membrane, 
but anchored to it by the TssJ N-terminal lipid moiety. This stage corresponds 


to receive the baseplate components. This state would correspond to a 
‘resting’ state of the T6SS machinery (Fig. 4b, stage 2). Ten arches 
cross the periplasm and are followed by ten pillars positioned in two 
concentric layers in the tip complex. The inner pillars define a channel 
of 15-20 A in diameter that is not large enough to allow the passage of 
the ~110A Hep tube’*'* (Extended Data Fig. 7b). Interestingly, it 
was previously shown that TssM undergoes large conformational 
changes during secretion*’. Therefore, we propose that the inner 
TssM pillars are pushed outwards to define a wider TssM ring with 
internal dimensions compatible with the passage of the Hcp tube/ 
VerG spike (Fig. 4b, stages 3 and 4, and Extended Data Fig. 7b, c). 
In other secretion systems, specific components are dedicated to 
assemble the outer membrane pore. No obvious transmembrane region 
could be found in the TssM C-terminal domain or in TssJ. It is unlikely 
that the C-terminal portion of TssM would form a pore of sufficient 
dimension by itself. Therefore, we propose that the stroke of the Hcp- 
VgrG arrow would mechanically push the C-terminal T'ssM domain 
towards the cell exterior, allowing the transient formation of a pore 
through the outer membrane (Fig. 4b, stage 4). To avoid deleterious 
effects for the bacterium, one may expect that the C-terminal domain of 
TssM returns to its initial ‘resting’ conformation at the periplasmic face 
of the outer membrane once the Hcp tube has been released, closing the 
outer membrane channel (Fig. 4b, stage 5). Overall, the membrane core 
complex appears to act like a docking station for the phage-like T6SS 
device. It nucleates the assembly of the rest of the secretion system and 
then guides the Hcp tube/VgrG spike through the bacterial cell envel- 
ope upon sheath contraction. Further studies will be necessary to fully 
understand the complete assembly process of the T6SS, the trigger that 
releases sheath contraction and how the Hcp tube/VegrG spike crosses 
both bacterial and host membranes. 


2 3 4 5 


to the EM reconstruction of the purified TssJLM complex (bottom inset) and 
the crystal structure of the TssM26c,-TssJ complex (top inset) presented in this 
study. Stage 2: upon assembly of T6SS baseplate and tail components, the 
C-terminal extremity of TssM inserts into the outer membrane and is therefore 
accessible at the cell surface (yellow star). This stage corresponds to the 
‘resting’ state of the T6SS membrane complex. Stages 3 and 4: the membrane 
core complex opens to allow the passage of the Hcp tube/VgrG spike or the 
sheath contraction force induces conformational changes of the TssJLM 
complex. A molecular model of a Cy symmetrized TssJ/TssMo6cy ring is 
presented (top inset). The apical loops of TssM are exposed at the cell surface 
(yellow stars). Stage 5: after release of the Hcp tube/VegrG spike, the TssJLM 
membrane complex returns to the resting state, ready to perform another 
cycle of secretion. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Strains, media and chemicals. The strains, plasmids and oligonucleotides used in 
this study are listed in Supplementary Table 1. The E. coli K-12 DH5« strain was 
used for cloning steps whereas E. coli K-12 BL21(DE3) and T7-Iq pLys strains 
were used for protein purification. The enteroaggregative E. coli EAEC strain 17-2 
was used to engineer gene knockouts and fusions with fluorescent labels. Strains 
were routinely grown in lysogeny broth (LB) rich medium (or Terrific broth 
medium for protein purification) or in Sci-1-inducing medium (SIM; M9 min- 
imal medium, glycerol 0.2%, vitamin B1 1 pg ml!, casaminoacids 100 ug ml“, 
LB 10%, supplemented or not with bactoagar 1.5%)*° with shaking at 37 °C. 
Plasmids were maintained by the addition of ampicillin (100 1g ml’ for E. coli 
K-12, 200 pig ml’ for EAEC) or kanamycin (50 pg ml’). Expression of genes 
from pETG20A and pRSF vectors was induced with 1 mM of isopropyl-B-p-thio- 
galactopyrannoside (IPTG, Eurobio) for 16h. 

Strain construction. Gene deletion into the enteroaggregative E. coli 17-2 strain 
was achieved by using a modified one-step inactivation procedure"! as previously 
described’? using plasmid pKOBEG”. Briefly, a kanamycin cassette was amplified 
from plasmid pKD4*' using oligonucleotide pairs carrying 5’ 50-nucleotide 
extensions homologous to regions adjacent to the gene to be deleted. After elec- 
troporation of 600 ng of column-purified PCR product, kanamycin-resistant 
clones were selected and verified by colony-PCR. The kanamycin cassette was 
then excised using plasmid pCP20 (ref. 41). Gene deletions were confirmed by 
colony-PCR. The same procedure was used to introduce the mCherry- or sfGFP- 
coding sequences downstream from the start codon (vector pKD4-sfGFP or 
pKD4-mCherry) or the mCherry-coding sequence upstream from the stop codon 
(vector pmCherry-KD4). This procedure yields strains producing fusion proteins 
from their original chromosomal loci. 

Plasmid construction. PCRs were performed using the Phusion DNA polymer- 
ase (Thermo Scientific). Restriction enzymes were purchased from New England 
Biolabs and used according to the manufacturer’s instructions. Custom oligonu- 
cleotides were synthesized by Sigma Aldrich and are listed in Supplementary 
Table 1. Enteroaggregative E. coli 17-2 chromosomal DNA was used as a template 
for all PCRs. E. coli strain DH5a was used for cloning procedures. The pETG20A 
vector derivative encoding the periplasmic domain of the TssM periplasmic 
domain (TssMp, residues 386-1129) or the TssJ soluble domain have been prev- 
iously described”. The fragment encoding TssMa¢c; (residues 869-1107) was 
cloned into the pETG20A vector by restriction-free cloning". The pRSF-TssJ® 
intermediate plasmid was constructed by restriction cloning. Briefly, the 
sequence encoding the full-length tssJ gene (residues 1-178) was PCR-amplified 
using primers RSF-sJ-F and RSF-sJ-R. The PCR introduced a 5’ Ndel and a 3’ 
Xhol restriction site and a C-terminal streptavidin extension. The tssJ PCR prod- 
uct was sub-cloned into the pRSF-Duet (Novagen) MCS2 corresponding restric- 
tion sites. The pRSF-TssJ*-"L-"™M (encoding C-terminally StrepII-tagged TssJ, 
N-terminally Flag-tagged TssL and N-terminally 6X His-tagged TssM) was con- 
structed by restriction-free cloning’? as previously described”. Briefly, the 
sequence encoding the full-length tssL (residues 1-217) and full-length tssM 
(residues 1-1129) genes were PCR-amplified using the primer pairs RSF-fL-F/ 
RSF-fL-R and RSF-hM-F/RSF-hM-R, respectively. The two PCR products (tssL 
and tssM) were synthesized with 30-base-pair overhangs, from both 5’ and 3’ 
ends, corresponding to the designed integration sites into the pRSF-TssJ* plas- 
mid. The double-stranded product of the first PCR was then used as oligonucleo- 
tides for a second PCR using the target vector as template. The introduction of 
the C1S mutation in TssJ was performed by QuikChange mutagenesis of the 
pRSE-TssJ*-"L-"M plasmid using oligonucleotides Jcs-F and Jcs-R. Plasmid 
pIBA37-TssM was constructed by restriction-free cloning and cysteine deriva- 
tives were obtained by QuikChange mutagenesis using pIBA37-TssM-C757S 
mutant as template. 

Hep release assay. Cells producing Flag- or HA-tagged Hcp from plasmids 
pUCHcpriag or POKH cpp,” were grown in SIM to an absorbance Agoo nm ~ 0.8. 
Supernatant and cell fractions were separated as previously described.;. Briefly, 
2 X 10° cells were harvested and collected by centrifugation at 2,000g for 5 min. 
The supernatant fraction was then subjected to a second low-speed centrifugation 
and then at 16,000g for 15 min. The supernatant was filtered on sterile polyester 
membranes with a pore size of 0.2 1m (Membrex 25 PET, membraPure) before 
overnight precipitation with trichloroacetic acid 15% on ice. Cells and precipitated 
supernatants were resuspended in loading buffer and analysed by SDS-PAGE and 
immunoblotting with the anti-Flag or anti-HA antibody. As control for cell lysis, 
western blots were probed with antibodies raised against the periplasmic TolB 
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protein. The assays were performed from three independent cultures, and a repres- 
entative experiment is shown. 

Interbacterial competition assay. The antibacterial growth competition assay 
was performed as described for the studies on the Citrobacter rodentium and 
EAEC Sci-2 T6SSs*!** with modifications. The wild-type E. coli strain W3110 
bearing the Kan® pUA66-rrnB plasmid“ was used as prey in the competition 
assay. Attacker and prey cells were grown for 16 h in LB medium, then diluted in 
SIM to allow maximal expression of the sci-1 gene cluster*’. Once the culture 
reached Ago nm ~ 0.8, the cells were harvested and normalized to A¢goo0 nm = 0.5 in 
SIM. Attacker and prey cells were mixed to a 4:1 ratio and 20-1 drops of the 
mixture were spotted in triplicate onto a pre-warmed dry SIM agar plate supple- 
mented or not with anhydrotetracyclin 0.02 1g ml’. After overnight incubation 
at 37 °C, the bacterial spots were then cut off, and cells were resuspended in SIM 
to A¢oo nm = 0.5. Two hundred microlitres of serial dilutions were plated on 
kanamycin LB plates and the number of colonies was scored after overnight 
incubation at 37 °C. The assays were performed from at least three independent 
cultures, with technical triplicates and a representative technical triplicate shown. 
Fluorescence microscopy, image treatment and statistical analyses. Fluores- 
cence microscopy experiments were performed essentially as described*)**. 
Briefly, cells were grown overnight in LB medium and diluted to Agoo nm ~ 0.04 
in SIM. Exponentially growing cells (A, ~ 0.8-1) were harvested, washed in 
phosphate buffered saline buffer (PBS), resuspended in PBS to Agoo nm ~ 50, 
spotted on a thin pad of 1.5% agarose in PBS, covered with a cover slip and 
incubated for 1h at 37°C before microscopy acquisition. For each experiment, 
ten independent fields were manually defined with a motorized stage (Prior 
Scientific) and stored (x, y, z, Perfect Focus System (PFS) offset) in our custom 
automation system designed for time-lapse experiments. Fluorescence and phase 
contrast micrographs were captured every 30s using an automated and inverted 
epifluorescence microscope TE2000-E-PFS (Nikon) equipped with PFS. PFS 
automatically maintains focus so that the point of interest within a specimen is 
always kept in sharp focus at all times despite mechanical or thermal perturba- 
tions. Images were recorded with a CoolSNAP HQ 2 (Roper Scientific) and a 
X100/1.4 DLL objective. The excitation light was emitted by a 120 W metal halide 
light. All fluorescence images were acquired with a minimal exposure time to 
reduce bleaching and phototoxicity effects. The sfGFP images were recorded by 
using the ET-GFP filter set (Chroma 49002) with an exposure time of 200-400 
ms. The mCherry images were recorded by using the ET-mCherry filter set 
(Chroma 49008) using an exposure time of 100-200ms. Slight movements of 
the whole field during the time of the experiment were corrected by registering 
individual frames using StackReg and Image Stabilizer plugins for ImageJ. sfGFP 
and mCherry fluorescence channels were adjusted and merged using Image] 
(http://rsb.info.nih.gov/ij/). sfGFP fluorescence sets of data were treated to mon- 
itor foci detection. Noise and background were reduced using the ‘Subtract 
Background’ (20 pixels Rolling Ball) plugin from Fiji (Image J/National 
Institutes of Health). The sfGFP foci were automatically detected by simple image 
processing: (1) create a mask of cell surface and dilate, (2) detect the individual 
cells using the ‘Analyse Particle’ plugin of Fiji and (3) identify foci by the ‘Find 
Maxima’ process in Fiji. To avoid false positive results, each event was manually 
controlled in the original data. Microscopy analyses were performed at least six 
times, each in technical triplicate, and a representative experiment is shown. Box- 
and-whisker plots representing the number of detected foci for each strain were 
made with R software. To compare each population, t-tests were performed in R. 
Sub-pixel resolution tracking of fluorescent foci: Fluorescent foci were detected 
using a local and sub-pixel resolution maxima detection algorithm and tracked 
over time with a specifically developed plug-in for ImageJ. The x and y coordi- 
nates were obtained for each fluorescent focus on each frame. The mean square 
displacement was calculated as the distance of the foci from its location at fy at 
each time using R software and plotted over time. For each strain tested, the mean 
square displacement of at least ten individual focus trajectories was calculated. 
Inner and outer membrane separation. Cells were broken using an Emulsiflex- 
C5 (Avestin) and the crude membrane fraction was isolated by ultracentrifuga- 
tion at 100,000g for 45 min. Outer and inner membranes were separated by 
differential solubilization. Inner membranes were solubilized by 0.5% sodium 
N-lauroyl sarcosyl in 50 mM Tris-HCl pH 8.0 for 30 min at 20 °C. The insoluble 
material containing the outer membrane fraction was isolated by ultracentrifuga- 
tion at 100,000g for 20 min. The outer membrane pellet was then solubilized in 
SDS-loading buffer. The assay was performed in triplicate, from three independ- 
ent cultures and a representative experiment is shown. 

TssJLM complex production and purification. The pRSF-TssJ°-"L-"'M plasmid 
was transformed into the E. coli BL21(DE3) expression strain (Invitrogen). Cells 
were grown at 37 °C in lysogeny broth (LB) to A¢oo nm ~ 0.7 and the expression of 
the tssJLM genes was induced with 1.0 mM IPTG for 16h at 16°C. Cell pellets 
were resuspended in ice-cold 50 mM Tris-HCl pH 8.0, 50 mM NaCl, 1 mMEDTA 
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and 1mM TCEP, supplemented with 100 pg ml~* of DNase I, 100 1g ml’ of 
lysozyme and EDTA-free protease inhibitor (Roche). After sonication, MgCl, 
was added to the final concentration of 10 mM and the cell suspension was 
further broken using an Emulsiflex-C5 (Avestin). The broken cell suspension 
was clarified by centrifugation at 38,500g for 20 min. The membrane fraction 
was then collected by centrifugation at 98,000g for 45 min. Membranes were 
mechanically homogenized and solubilized in 50 mM Tris-HCl pH 8.0, 50 mM 
NaCl, 0.5% (w/v) n-dodecyl-f-b-maltopyranoside (DDM, Anatrace), 0.75% (w/v) 
decyl maltose neopentyl glycol (DM-NPG, Anatrace), 0.5% (w/v) digitonin (Sigma- 
Aldrich), 100 1M TCEP and 1 mM EDTA at 22 °C for 45 min. The suspension was 
clarified by centrifugation at 98,000g for 20 min. The supernatant was loaded onto a 
5-ml StrepTrap HP (GE Healthcare) column and then washed with 50 mM Tris- 
HCl pH 8.0, 50mM NaCl, 0.05% (w/v) DM-NPG (Affinity buffer) at 4°C. The 
TssJLM core complex was eluted in the affinity buffer supplemented with 2.5 mM 
desthiobiotin (IBA) into a 5-ml HisTrap HP (GE Healthcare) column. The column 
was then washed in the affinity buffer supplemented with 20 mM imidazole and the 
TssJLM core complex was eluted in the same buffer supplemented with 500 mM 
imidazole. Peak fractions were pooled and loaded onto a Superose 6 10/300 
column (GE Healthcare) equilibrated in 50 mM Tris-HCl pH 8.0, 50mM NaCl, 
0.025% (w/v) DM-NPG. The TssJLM core complex eluted as a single monodisperse 
peak close to the void volume of the column. The sample was used immediately for 
EM sample preparation. 

Stoichiometry analyses. The purified TssJLM core membrane complex was 
diluted to a final concentration of 0.1 mg ml‘ and denaturated at 100°C for 
10 min after the addition of 1% sodium dodecyl sulfate. The denatured sample 
was incubated in presence of 40M of Alexa Fluor 532 C5-maleimide 
(Invitrogen) and 1 mM TCEP (Pierce) for 2h at room temperature. The labelled 
proteins were separated by SDS-PAGE and protein-bound fluorescence was 
visualized and quantified using a Fujifilm FLA-3000 scanner. The assay was 
performed in triplicate, from three independent TssJLM complex preparations, 
and a representative experiment is shown. The quantification is expressed with 
the standard deviation from the three biological replicates. 

EM and image processing. Determination of the TssJLM core membrane com- 
plex structure was achieved by negative-stain EM. Nine microlitres of suitably 
diluted (~0.01mg ml~') TssJLM complex sample was spotted to glow-dis- 
charged carbon-coated copper grids (Agar Scientific). After 30s of absorption, 
the sample was blotted, washed with three drops of water and then stained with 
2% uranyl acetate. Images were recorded automatically using the EPU software 
on an FEG microscope operating at a voltage of 200 kV and a defocus range of 
0.6-25 nm, using an FE] Falcon-II detector (Gatan) at a nominal magnification of 
50,000, yielding a pixel size of 1.9A. A dose rate of 25 electrons per square 
angstrém per second, and an exposure time of 1 s, were used. A total of 72,146 
particles were automatically selected from 1,200 independent images and 
extracted within boxes of 280 pixels X 280 pixels using EMAN2/BOXER™. The 
defocus value was estimated and the contrast transfer function was corrected by 
phase flipping using EMAN2 (e2ctf). All 2D and 3D classifications and refine- 
ments were performed using RELION 1.3 (refs 48, 49). We used three rounds of 
reference-free 2D class averaging to clean up the automatically selected data set. 
Only highly populated classes displaying high-resolution features were conserved 
during this procedure and a final data set of 26,544 particles was assembled. An 
initial 3D model was generated in EMAN2 using 30 classes. Three-dimensional 
classification was then performed in Relion with five classes. The particles cor- 
responding to the most populated class (~16,738) were used for refinement. 
Relion auto-refine procedure was used to obtain a final reconstruction at 
11.56-A resolution after masking and with C; symmetry imposed. Reported 
resolutions are based on the ‘gold standard’ Fourier shell correlation (FSC) 
0.143 criterion, and FSC curves were corrected for the effects of a soft mask on 
them by using high-resolution noise substitution”®. Before visualization, all den- 
sity maps were corrected for the modulation transfer function of the detector and 
then sharpened by applying an ad hoc negative B-factor (— 1,000). Local resolu- 
tion variations were estimated using ResMap”’. 

Three-dimensional maps display and analysis. Three-dimensional reconstruc- 
tions were displayed and rendered in USCF Chimera segmented using the 
SEGGER module implemented in UCSF Chimera*. Segments corresponding 
to individual structural domains are represented in Fig. 3c-e. All other maps 
were left un-segmented. A volume/mass conversion of 0.81 Da A~? was used to 
estimate the volume occupied by TssM and TssL cytoplasmic domains. 

Protein production and purification for SAXS and X-ray analyses. The peri- 
plasmic domain of the TssM protein, TssMp (residues 386-1129), was produced 
and purified as described previously”. The purified recombinant TssMp was 
digested with trypsin (at a 1,000:1 molecular ratio) at room temperature for 
24h. The reaction was quenched by the addition of 1mM phenyl-methane- 
sulfonyl fluoride (PMSF) and the insoluble TssMp fragments were discarded 


by centrifugation at 20,000g for 30 min. A proteolysis-resistant fragment 
of apparent size ~32 kDa (called hereafter TssM32c,) was further purified by 
consecutive ion-exchange (Mono Q 5/50 GL column, GE Healthcare) and size- 
exclusion (Superdex 75 16/600 HL column) chromatographies using an Akta 
system (GE Healthcare). The purified fragment was subjected to N-terminal 
Edman sequencing. A PVDF membrane was rinsed three times with a water/ 
ethanol mixture (10/90) and inserted in the A cartridge of a Procise 494A sequen- 
cer. After five cycles of Edman degradation, the sequence DYGSL was identified 
by mass spectrometry, indicating that cleavage after Arg834 generated a 
C-terminal fragment of theoretical mass 32,398 Da, in agreement with the 32- 
kDa band observed by SDS-PAGE analyses. 

For production and purification of the TssM26cr fragment (Thr869 to 
Glu1107), E. coli BL21(DE3) cells cultivated in the TB medium carrying plasmid 
pETG20A-TssMa6cr were grown to A¢oo nm ~ 0.6 and the expression of TssMbect 
was induced by the addition of 0.5mM IPTG for 16h at 17°C. Cells were 
collected by centrifugation at 10,000g at 4°C for 15min. The cell pellet was 
resuspended in lysis buffer and lysed by sonication. The lysate was clarified by 
centrifugation at 20,000g at 4 °C for 15 min, and the supernatant containing the 
Trx-Hisg—tev-TssM 6. fusion protein was purified by consecutive Ni** -affinity 
and size-exclusion (Superdex 75 column) chromatographies on an Akta purifier 
(GE Healthcare). The fractions containing the protein of interest were pooled and 
the 6X His-tagged TEV protease was added (5% w/w). The cleaved protein was 
purified using Ni** affinity, removing the Trx-His,, followed by size-exclusion 
chromatography (Superdex 75 column) on an Akta purifier (GE Healthcare). 
Over 100 mg of TssM.¢.1 fragment were obtained per litre of culture. The purified 
protein was verified by mass spectrometry, before being concentrated up to 
8.7mg ml in 20 mM Tris-HCl pH 8.0, NaCl 150 mM. 

The production of nanobody nb25 and the formation of its complex with 
TssM32ct have been described previously*’**. The production of unacylated 
TssJ was previously described*. The TssM2z6c:-TssJ complex was obtained by 
mixing TssMz¢6c, (8.7 mg ml!) with purified TssJ (30 mg ml!) in a 1:1.2 
molecular ratio and the complex was then concentrated up to 15mg ml‘ using 
centricon (cut-off of 10,000 Da) in 20 mM Tris-HCl pH 8.0, 150 mM NaCl. 
SAXS and ab initio 3D shape reconstruction. SAXS analyses were performed at 
the ID29 beamline (European Synchrotron Radiation Facility, Grenoble, France) 
at a working energy of 12.5keV (2= 0.931 A). Thirty microlitres of protein 
solution at 1.6 mg ml! in Tris-HCl 20 mM pH 8.0, NaCl 150 mM, were loaded 
by a robotic system into a 2-mm quartz capillary mounted in a vacuum. Ten 
independent 10-s exposures were collected on a Pilatus 6M-F detector placed at a 
distance of 2.85 m for each protein concentration. Individual frames were pro- 
cessed automatically and independently at the beamline by the data collection 
software (BsxCUBE), yielding radially averaged normalized intensities as a func- 
tion of the momentum transfer q, with q = 4nsin(0)/A, where 20 is the total 
scattering angle and J is the X-ray wavelength. Data were collected in the range 
q = 0.04-6 nm’. The ten frames were combined to give the average scattering 
curve for each measurement. Data points affected by aggregation, possibly 
induced by radiation damage, were excluded. Scattering from the buffer alone 
was also measured before and after each sample analysis and the average of these 
two buffer measures was used for background subtraction using the program 
PRIMUS” from the ATSAS package”*. PRIMUS was also used to perform Guinier 
analysis® of the low q data, which provides an estimate of the radius of gyration 
(Rg). Regularized indirect transforms of the scattering data were performed with 
the program GNOM” to obtain P(r) functions of interatomic distances. The P(r) 
function has a maximum at the most probable intermolecular distance and goes 
to zero at Drax, the maximum intramolecular distance. The values of D,,.,x were 
chosen to fit with the experimental data and to have a positive P(r) function. 
Three-dimensional bead models that fitted with the scattering data were built 
with the program DAMMIF*’. Twenty independent DAMMIF runs were per- 
formed using the scattering profile of the TRX-His-TssJp and TRX-His-TssMp 
complexes, with data extending up to 0.35 nm”, using slow mode settings, 
assuming no symmetry and allowing for a maximum 500 steps to grant conver- 
gence. The models resulting from independent runs were superimposed using the 
DAMAVER suite”. This yielded an initial alignment of structures based on their 
axes of inertia followed by minimization of the normalized spatial discrepancy”. 
The normalized spatial discrepancy was therefore computed between a set of 20 
independent reconstructions, with a range of normalized spatial discrepancies 
from 0.678 to 0.815. The aligned structures were then averaged, giving an effective 
occupancy to each voxel in the model, and filtered at half-maximal occupancy to 
produce models of the appropriate volume that were used for all subsequent 
analyses. All the models were similar in terms of agreement with the experimental 
data, as measured by DAMMIF x parameter and the quality of the fit to the 
experimental curve. The DAMFILT average volume was used as the final model 
of the TRX-His-TssJ and TRX-His-TssMp complexes. 
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Ni-TNA-Nanogold labelling. The TssJLM complex was spotted onto a glow- 
discharged carbon coated grid (CF-400, Electron Microscopy Sciences). After 
1 min, excess liquid was blotted, and the grid was washed on a drop of cold 
purification buffer (50mM Tris pH 8, 50mM NaCl, 0.025% (w/v) DM-NPG) 
containing 50 mM imidazole, quickly blotted and deposited on a second drop of 
the same buffer in the presence of 5 nM Ni-TNA-Nanogold beads (Nanoprobes). 
After 2 min, the grid was rinsed sequentially for 20 s with one drop of purification 
buffer, one drop of the same buffer without detergent and three drops of 2% 
uranyl acetate. Images were collected on an FEI Tecnai F20 FEG microscope 
operating at a voltage of 200kV, equipped with a direct electron detector 
(Falcon II). Particles were selected manually using EMAN2. The assay was per- 
formed at least in triplicate, from independent TssJLM complex preparations, 
and representative particles are shown. 

Anti-Strep labelling. The TssJLM complex was mixed with monoclonal anti- 
Strep antibodies (Sigma) at a ratio of complex:antibody of 2:1. The mixture was 
incubated at 4 °C for 30 min and the labelled complex was isolated by gel filtra- 
tion. The sample was analysed by negative-stain EM as described above for 
negative-stain EM of the unlabelled TssJLM complex. The assay was performed 
at least in triplicate, from independent TssJLM complex preparations, and rep- 
resentative particles are shown. 

Crystallization and structure determination. The crystallization of the 
TssM32c;-nb25 complex has been described previously*'. For TssM2¢6c; alone, 
several kits were used for crystallization screening, including STURA, WIZARD, 
MDL, INDEX and PEGs. A hit was observed in the PEGs kit, within a well 
containing 0.2 M zinc acetate and 20% (m/v) PEG3350. Crystal optimization 
was performed by varying PEG3350 amount in the 15-25% range in 0.1M 
sodium acetate and 0.2 M ZnCl, at pH varying between 3.8 and 5.5. Crystals 
appeared after few days in 20% PEG3350, 0.1 M sodium acetate pH 4 and 0.2 M 
ZnCl,. Crystals were tested at the European Synchrotron Radiation Facility 
(ESRF) ID23-1 beamline after cryo-cooling in the crystallization liquor supple- 
mented with 12.5% propylene glycol. 

The TssM26c1-TssJ complex was screened for crystallization using the PEGs 
and PACT] kits. Hits were observed in PACT]. All contained Zn”*: 0.01 M zinc 
chloride, 0.1 M sodium acetate pH 5, 20% (w/v) PEG 6000; or 0.01 M zinc chlor- 
ide, 0.1 M MES pH 6, 20% (w/v) PEG 6000. Crystal optimization was performed 
by using PEG6000 in the 10-20% range in 0.1 M sodium acetate/MES pH 4.75-6, 
and ZnCl, at 0, 0.01, 0.05 and 0.2 M. No crystals were obtained in conditions 
without ZnCl, or containing 0.2 M ZnCl). By contrast, well-shaped crystals 
appeared in 50mM ZnCl, 15% PEG6000 and 0.1 sodium acetate pH 4.75. 
Crystals were cryo-cooled with polypropylene glycol 12.5% but diffracted to only 
~4.0 A at the Soleil Proxima 2 beamline (Saint Aubin, France). Further crystals 
were obtained in LIMBRO plates. Large crystals were obtained by mixing 6 pl of 
protein and 2 11] of well solution in 50 mM ZnCl, 15% PEG6000, 94 mM sodium 
acetate pH 4.75 and 6mM MES pH 6. Crystals were dipped in polypropylene 
glycol for ~20s and exposed at the ESRF ID23-1 beamline. 

Data collection was performed at 100 K at the Soleil Proxima 1 beamline (Saint 
Aubin, France) for TssM32¢,-nb25 and at the ID23-1 beamline (ESRF synchro- 
tron, Grenoble, France) for TssM26c; alone and for the TssMz6c:-TssJ complex. 
Data were processed by the XDS® package and scaled with XSCALE (Extended 
Data Table 1). 

The structure of the TssM3.¢,-nb25 complex was determined by molecular 
replacement with Molrep® using the previously determined structure of nb25 
(Protein Data Bank accession number 4QGY)”. After refining the positions of 
the two nb25 molecules in the asymmetric unit by rigid body refinement with 
AutoBuster”, an electron density map was calculated at 1.92-A resolution. 
Features such as a-helices were easily identified, making it possible to trace 
manually the model of TssM32¢ using COOT®, alternated with cycles of refine- 
ment with AutoBuster®, with non-crystallographic symmetry restraints, and 
translation, rotation and screw-rotation (TLS) group refinement, features 
used in all refinement procedures described below. The final structure at 
1.92-A resolution had Rwork/Riree Values of 18.4/21.0%, 96.3% of the residues 
in the preferred area of the Ramachandran plot and no outliers (Extended Data 
Table 1). 

The structure of TssMa6ct alone was solved by molecular replacement with 
Molrep*' using the refined model of TssM32¢; from the TssM32¢,-nb25 complex. 
The initial structure model was improved through iterative refinement with 
AutoBuster® and manual refitting with COOT®. The final structure at 1.51-A 
resolution had Ryork/Rfree values of 19.2/20.2%, 97.4% of the residues in 
the preferred area of the Ramachandran plot and no outliers (Extended 
Data Table 1). 

The structure of the TssM26c:-TssJ complex was solved by molecular replace- 
ment with Molrep® using the refined structure of TssMy¢. and the previously 
determined TssJ structure (Protein Data Bank 3RX9)*° in which all the other 
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conformations were removed. A first round of rigid body refinement and four 
cycles of Phenix® cartesian-simulated annealing were performed. The resulting 
model was improved through iterative refinement with AutoBuster® and manual 
refitting with COOT®. The final structure at 2.24-A resolution had Ryork/ Rice 
values of 20.0/22.3%, 96.9% of the residues in the preferred area of the 
Ramachandran plot and four outlier residues in very poorly defined loops 
(Extended Data Table 1). The TssM32c,-nb25, TssMagcy and TssM26c;-TssJ 
structures form similar homodimers in the asymmetric unit. However, as 
reported by PISA®, and the known topologies of TssM and Tss]”*”, these dimers 
are not biologically relevant. Molecular contacts were analysed by the PISA 
server® and figures were prepared with Chimera™ and Pymol®”. 

The crystal structures of the TssM3¢,-nb25 complex, and of the TssMa6ct 
fragment and TssM2¢6c:-TssJ complexes, have been deposited in the Protein 
Data Bank under accession numbers 4Y7M, 4Y7L and 4Y70 respectively. 
Docking TssM2¢-TssJ structure and TssL.y;,. The crystal structures of the 
TssMo¢ct-rTssy Complex and of the TssL cytoplasmic domain (Protein Data Bank 
3U66)** were docked automatically using Chimera” after map segmentation. 
Refinement of docked TssM26c,-J pentamer in the EM density map. The 
atomic model of the docked TssMa¢cr-J structure was refined in the EM density 
with RSRef®. First, missing side-chain and polar hydrogen atoms were added 
with Modeller”. The structure was minimized using 2,000 steps of least-squares 
conjugate gradient refinement in the presence of distance restraints for hydrogen 
bonds and backbone dihedral angle restraints to maintain secondary structures. 
The minimization was performed with the real-space objective function calcu- 
lated by RSRef in CNS”. The C; symmetry was enforced by strict non-crystal- 
lographic symmetry restraints. The total energy included internal parameters 
(bond length, bond angle, improper and dihedral angles) and non-bonded inter- 
actions with full Van der Waals and electrostatic potentials using a 7.5 A cutoff. 
The final correlation coefficient between the EM reconstruction and the refined 
TssMa6cr-J] atomic model was 0.929 (as calculated by RSRef), whereas it was 0.706 
before minimization. 

Modelling of TssM26cr-J decamer. The atomic position of a TssMa6cr-J proto- 
mer from the outer ring of the pentamer served as the starting structure to 
generate a TssM6c1-J decamer model with cyclic ten-fold symmetry using 
CNS”. The symmetry was enforced by strict non-crystallographic symmetry 
restraints (rotations of 36° around the symmetry axis). First, 5,000 steps of rigid 
body minimization were performed including only inter-protomer energetic 
contributions (full Van der Waals and electrostatic potentials). After a short 
all-atom minimization (300 steps), 1.5 ps of molecular dynamics simulation at 
1,000 K was performed, followed by 300 steps of minimization and 10 ps of 
molecular dynamics simulation at 200 K. Minimizations and molecular dynamics 
simulations were realized with both intra-protomer and inter-protomer energetic 
contributions activated, and the backbone conformation of the protomer was 
restrained with harmonic constraints. 

Substituted cysteine accessibility method. Cysteine accessibility experiments 
were performed on whole cells, mainly as described’*” with modifications. A 
20-ml culture of wild-type or AtssBC strains producing a periplasmic cysteine- 
less TssM (Cys727-to-Ser) or derivatives bearing cysteine substitutions were 
induced for tssM gene expression with 0.05 1g ml‘ anhydrotetracyclin (AHT) 
for 1h. Cells were harvested and resuspended in buffer A (100 mM Hepes (pH 
7.5), 150 mM NaCl, 25 mM MgCl,) to a final Agoo nm Of 12 in 500 pl of buffer A. 
Bovine serum albumin (BSA)-coupled maleimide (Sigma-Aldrich) was added to 
a final concentration of 100 [1M (from a 20 mM stock freshly dissolved in DMSO) 
and the cells were incubated for 30 min at 25 °C. B-Mercaptoethanol (20 mM final 
concentration) was added to quench the biotinylation reaction, then cells were 
washed twice in buffer A and resuspended in buffer A containing N-ethyl mal- 
eimide (final concentration 5mM) to block all free sulfhydryl residues. After 
incubation for 20 min at 25 °C, cells were disrupted by sonication. Membranes 
recovered by ultracentrifugation at 100,000g for 40 min were resuspended in 
Laemmli buffer before SDS-PAGE analysis and immunodetection with anti- 
Flag antibodies (to detect the TssM proteins). Controls were performed by label- 
ling total membranes from the same samples instead of whole cells. The assay was 
performed in triplicate, from three independent cultures, and a representative 
experiment is shown. 

SDS-PAGE, protein transfer, immunostaining and antibodies. SDS-PAGE 
was performed on Bio-Rad Mini-PROTEAN systems using standard protocols. 
For immunostaining, proteins were transferred onto 0.2-11m nitrocellulose mem- 
branes (Amersham Protran). Immunoblots were probed with primary antibodies 
and goat secondary antibodies coupled to alkaline phosphatase, and developed in 
alkaline buffer in presence of 5-bromo-4-chloro-3-indolylphosphate and nitro- 
blue tetrazolium. The anti-TolB was from our laboratory collection, whereas 
the anti-HA (3F10 clone, Roche), anti-Flag (M2 clone, Sigma Aldrich), anti- 
StrepII (Sigma Aldrich), anti-5his (Sigma Aldrich) monoclonal antibodies and 
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alkaline-phosphatase-conjugated goat anti-rabbit or mouse secondary antibodies 
(Millipore) were purchased as indicated. 


55. 


40. 


Al. 


Brunet, Y. R., Bernard, C. S., Gavioli, M., Lloubés, R. & Cascales, E. An epigenetic 
switch involving overlapping fur and DNA methylation optimizes expression of a 
type VI secretion gene cluster. PLoS Genet. 7, e€1002205 (2011). 

Datsenko, K. A. & Wanner, B. L. One-step inactivation of chromosomal genes in 
Escherichia coli K-12 using PCR products. Proc. Natl Acad. Sci. USA 97, 6640-6645 
(2000). 


. Chaveroche, M. K., Ghigo, J. M. & d’Enfert, C. A rapid method for efficient gene 


replacement in the filamentous fungus Aspergillus nidulans. Nucleic Acids Res. 28, 
e97 (2000). 


. van den Ent, F. & Lowe, J. RF cloning: a restriction-free method for inserting target 


genes into plasmids. J. Biochem. Biophys. Methods 67, 67-74 (2006). 


. Gueguen, E. & Cascales, E. Promoter swapping unveils the role of the Citrobacter 


rodentium CTS1 type VI secretion system in interbacterial competition. Appl. 
Environ. Microbiol. 79, 32-38 (2013). 


. Zaslaver, A. et al. A comprehensive library of fluorescent transcriptional reporters 


for Escherichia coli. Nature Methods 3, 623-628 (2006). 


. Zoued, A. etal. TssK is a trimeric cytoplasmic protein interacting with components 


of both phage-like and membrane anchoring complexes of the type VI secretion 
system. J. Biol. Chem. 288, 27031-27041 (2013). 


. Tang, G. et al. EMAN2: an extensible image processing suite for electron 


microscopy. J. Struct. Biol. 157, 38-46 (2007). 


. Scheres, S. H. RELION: implementation of a Bayesian approach to cryo-EM 


structure determination. J. Struct. Biol. 180, 519-530 (2012). 
Scheres, S. H. Semi-automated selection of cryo-EM particles in RELION-1.3. 
J. Struct. Biol. 189, 114-122 (2015). 


. Chen, S. et al. High-resolution noise substitution to measure overfitting and 


validate resolution in 3D structure determination by single particle electron 
cryomicroscopy. Ultramicroscopy 135, 24-35 (2013). 


. Kucukelbir, A., Sigworth, F. J. & Tagare, H. D. Quantifying the local resolution of 


cryo-EM density maps. Nature Methods 11, 63-65 (2014). 


. Pettersen, E. F. et al. UCSF Chimera — a visualization system for exploratory 


research and analysis. J. Comput. Chem. 25, 1605-1612 (2004). 


. Konarev, P. V., Volkov, V. V., Sokolova, A. V., Koch, M. H. & Svergun, D. 1. PRIMUS: a 


Windows PC-based system for small-angle scattering data analysis. J. Appl. 
Crystallogr. 36, 1277-1282 (2003). 


. Konarev, P. V., Petoukhov, M. V., Volkov, V. V.& Svergun, D. 1. ATSAS 2.1, a program 


package for small-angle scattering data analysis. J. Appl. Crystallogr. 39, 277-286 
(2006). 

Guinier, A. La diffraction des rayons X aux trés petits angles; application a l’étude 
de phénoménes ultramicroscopiques. Ann. Phys. (Paris) 12, 161-237 (1939). 


56. 


57. 


74. 


75. 


. Winn, M. D., 


. The PyMOL 
. Chapman, M. S., Trzynka, A. & Chapman, B. K. Atomic modeling of cryo-electron 


. Brunger,A.T. Version 1.2 of the Crystallography and N 


. Goemaere, E. L., Devert, A., Lloubés, R. & Cascales, E. 


Svergun, D. |. Determination of the regularization parameter in indirect-transform 
methods using perceptual criteria. J. Appl. Crystallogr. 25, 495-503 (1992). 
Franke, D. & Svergun, D. |. DAMMIF, a program for rapid ab-initio shape 
determination in small-angle scattering. J. Appl. Crystallogr. 42, 342-346 (2009). 


. Volkov, V. V. & Svergun, D. |. Uniqueness of ab initio shape determination in small- 


angle scattering. J. Appl. Crystallogr. 36, 8360-864 (2003). 


. Kozin, M. B. & Svergun, D. |. Automated matching of high- and low-resolution 


structural models. J. Appl. Crystallogr. 34, 33-41 (2001). 


. Kabsch, W. XDS. Acta Crystallogr. D 66, 125-132 (2010). 
. Vagin, A. & Teplyakov, A. Molecular replacement with MOLREP. Acta Crystallogr. D 


66, 22-25 (2010). 


. Blanc, E. et a/. Refinement of severely incomplete structures with maximum 


likelihood in BUSTER-TNT. Acta Crystallogr. D 60, 2210-2221 (2004). 


. Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of 


Coot. Acta Crystallogr. D 66, 486-501 (2010). 
urshudov, G. N. & Papiz, M. Z. Macromolecular TLS refinement in 
REFMAC at moderate resolutions. Methods Enzymol. 374, 300-321 (2003). 


. Adams, P. D. et a/. PHENIX: a comprehensive Python-based system for 


macromolecular structure solution. Acta Crystallogr. D 66, 213-221 (2010). 


. Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline 


state. J. Mol. Biol. 372, 774-797 (2007). 
olecular Graphics System. v.1.5.0.4 (Schrédinger, LLC, 2014). 


microscopy reconstructions — joint refinement of model and imaging parameters. 
J. Struct. Biol. 182, 10-21 (2013). 


. Sali, A.& Blundell, T. L. Comparative protein modelling by satisfaction of spatial 


restraints. J. Mol. Biol. 234, 779-815 (1993). 
Rsystem. Nature Protocols 
2, 2728-2733 (2007). 


. Bogdanov, M., Zhang, W., Xie, J. & Dowhan, W. Transmembrane protein topology 


mapping by the substituted cysteine accessibility method (SCAM™): application 
0 lipid-specific membrane protein topogenesis. Methods 36, 148-171 (2005). 
ovements of the ToIR 
C-terminal domain depend on TolQR ionizable key residues and regulate activity 
of the Tol complex. J. Biol. Chem. 282, 17749-17757 (2007). 


. Du, D. et al. Structure of the AcrAB-TolC multidrug efflux pump. Nature 509, 


512-515 (2014). 

Hodgkinson, J. L. et al. Three-dimensional reconstruction of the Shigella T3SS 
ransmembrane regions reveals 12-fold symmetry and novel features throughout. 
Nature Struct. Mol. Biol. 16, 477-485 (2009). 

Low, H. H. et al. Structure of a type IV secretion system. Nature 508, 550-553 
(2014). 


©2015 Macmillan Publishers Limited. All rights reserved 


~ 
N a 
| | 


a 
L 


W3110 kan? survival 
a (log cfu) 


AtssM ggrpTSSM AtSSL sige SSL 


e 
a a a 
6-7 
WT | AtssM ‘ageeTSSM’' AtSSL ' gqee TSSL 
WT AtssJ TsSusigep TsSJUmch 
gO 8 © .6 6 & «< s 
oe -TolB 
30 @ 
_— — — Hep 
16@ 
Cc 
DIC 
fluo 
merge 


. 


ARTICLE 


Mean square displacement [a.u] 


— sicrp SSM 


— siarp SSL 


@O 


time 
f (min) 


DIC 
TSSBinch 
stare SSL 
merge 


schematic 
diagram 


g 


number of TssB,,c, sheathes 


25 


2.0 


05 


0.0 


©2015 Macmillan Publishers Limited. All rights reserved 


T T T T T 
0 1 2 3 4 


number of .qrpTssL cluster / cell 


n=42 


n=60 n=48 


AtssJ — AtssL 


starp SSM 


AtssJ  AtssM 


stare [SSL 


ARTICLE 


Extended Data Figure 1 | Functional and dynamic properties of 
fluorescently labelled Tss proteins. a, GFP-TssM and GFP-TssL fusion 
proteins are functional. Top: Hcp release assay. Hcp release was assessed by 
separating whole cells (C) and supernatant (S) fractions from the indicated 
strains. A total of 1 X 10” cells and the TCA-precipitated material from the 
supernatant of 2 X 10° cells were analysed by western blot using anti-Flag 
monoclonal antibody (lower panel) and anti-TolB polyclonal antibodies as a 
lysis control (upper panel). The molecular mass markers (in kilodaltons) are 
indicated on the left. Bottom: anti-bacterial assay. The anti-bacterial activity 
was assessed by mixing kanamycin-resistant prey E. coli K-12 cells with the 
indicated attacker cells for 16 h at 37 °C in SIM. The number of recovered E. coli 
prey cells is indicated in the graph (as log of colony-forming units (c.f.u.)). The 
circles indicate values from three independent assays, and the average is 
indicated by the bar. b, TssJ-sfGFP and TssJ-mCh fusion proteins are non- 
functional. Hcp release was assessed by separating whole cells (C) and 
supernatant (S) fractions from the indicated strains. A total of 1 x 10° cells and 
the TCA-precipitated material from the supernatant of 2 X 10” cells were 
analysed by western blot using anti-Flag monoclonal antibody (lower panel) 
and anti-TolB polyclonal antibodies as a lysis control (upper panel). The 
molecular mass markers (in kilodaltons) are indicated on the left. c, sfGFP- 
TssM and sfGFP-TssL cluster in foci. Large fields of fluorescence microscopy 
recordings showing localization of the sfGFP-TssL (left) and sfGFP-TssM 
(right) fusion proteins. The positions of selected foci are indicated by 


arrowheads. Scale bars, 5 um. d, sfGFP-TssM and sfGFP-TssL foci are stable 
and static. Mean square displacement (in arbitrary units (a.u.)) of sfGFP-TssM 
(blue line) and sfGFP-TssL (red line) clusters were measured by sub-pixel 
tracking of fluorescent foci and plotted over time (in minutes). e, The TssBC 
sheath tubular structures assemble on TssJLM membrane complexes. Statistical 
analyses reporting the average number of sheath per cell compared with the 
number of membrane complexes per cell, highlighting the observation that the 
number of membrane complexes is at least equal to the number of sheathes. 
Lower and upper boundaries of the boxes correspond to the 25% and 75% 
percentiles respectively. Black bold horizontal bar, median values for each 
strain; whiskers, 10% and 90% percentiles; n indicates the number of cells 
studied per strain. f, Long-term fluorescence microscopy recordings. Time- 
lapse fluorescence microscopy recordings showing localization and dynamics 
of the sfGFP-TssL and TssB-mCherry fusion proteins. Individual images were 
taken every 15 min. Assembly/contraction of the sheath and TssL localization 
events are schematized in the lowest panel. Scale bars, 1 jum. g, Statistical 
analysis of sfGFP-TssM and sfGFP-TssL localization in various tss 
backgrounds. Shown are box-and-whisker plots of the measured number of 
sfGFP-TssM and sfGFP-TssL foci per cell for each indicated strain with the 
lower and upper boundaries of the boxes corresponding to the 25% and 75% 
percentiles respectively (horizontal bar, the median values for each strain; 
whiskers, the 10% and 90% percentiles); n indicates the number of cells studied 
per strain. 
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Extended Data Figure 2 | Expression and purification of the T6SS 
membrane core complex. a-e, T6SS operon genomic organization and 
constructs used for in vitro analyses. a, Schematic representation of the T6SS 
sci-1 gene cluster from entero aggregative E. coli. The numbers on top refer 
to the gene locus tag (EC042_XXXX). Genes encoding core components 
(identified by their names on bottom, for example, ‘B’ refers to the tssB gene) are 
coloured grey. Genes of unknown function are coloured white. The three genes 
used to reconstitute the core membrane complex are coloured orange (tss/), 
blue (tssL) and green (tssM). b, Schematic representation of the engineered 
constructs: the tssJ, tssL and tssM genes were amplified with an additional Shine 
Dalgarno (SD) sequence and 3’ StrepII, 5’ Flag and 5’ 6X His tags respectively. 
These three fragments were cloned into the pRSF-Duet vector (c). This 
construct allows the production of the C-terminally StrepII-tagged TssJ 

outer membrane (OM) lipoprotein and N-terminally Flag-tagged TssL and 
6X His-tagged TssM inner-membrane (IM) proteins (d, e). The proteins 

are schematized and their boundaries and principal characteristics (TM, 
transmembrane segments; SP, signal peptide; CYS, acylated cysteine) are 


indicated (d) and their topologies are shown (e). The additional TssM 
constructs (TssMp, TssM32c; and TssM26cy) used for SAXS or X-ray analyses 
are shown at the bottom. f-h, Purification and biochemical characterization 
of the T6SS membrane core complex. f, Analytical size-exclusion chromato- 
graphy analysis of the purified TssJLM complex (continuous line) on a 
Superose 6 column, calibrated with 75-, 158-, 440- and 660-kDa molecular 
mass markers (dotted lines). The molecular mass of each marker (in 
kilodaltons) is indicated on the top of the corresponding peak. An arrow 
indicates the position of the peak fraction corresponding to the TssJLM 
complex. g, SDS-PAGE of the purified TssJLM complex analysed by 
Coomassie staining (CB) or immunoblotting using anti-His («-His), -Flag 
(a-Flag) and -StrepII («-STREP) antibodies. h, Left: cysteine labelling of the 
purified TssJLM complex in reducing and denaturing conditions as described 
in Methods. The total number of cysteine residues was nine for TssM, five 
for TssL and none for TssJ (the N-terminal cysteine is acylated). Right: the 
relative amount of TssL compared with TssM (densitometry relative to 

the number of free cysteine residues, fixed at 1 for TssM). 
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Extended Data Figure 3 | Architecture of the T6SS membrane core 
complex. a, Negative-stain EM of the EAEC TssJLM complex. Representative 
micrograph of the data set used for image processing. Isolated TssJLM 
complexes were clearly visible (white circles). b, Plot of the rotational 
autocorrelation function for a representative class average of an end-view. 

c, FSC curve of the TssJLM reconstruction. The ‘gold standard’ FSC curve 
was calculated in Relion using the masked reconstruction of the TssJLM 
complex. The resolution at 0.143 correlation was 11.56 A. d, Top: side and 
corresponding cut-away views of the 3D reconstruction for the whole 


TssJLM complex. Bottom: local resolution as calculated by Resmap. The 
TssJLM volume (left reconstruction, side view; right reconstruction, cut-away 
view) is coloured according to the local resolution from high resolution 

(~12 A) in blue to low resolution (>30 A) in red. e, FSC curve of the TssJLM 
base. The ‘gold standard’ FSC curve was calculated in Relion using the 
unmasked reconstruction of the TssJLM base. The resolution at 0.143 
correlation was 16.6 A. f, Top, side and bottom views of the 3D reconstruction 
after specific refinement of the base. 
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Extended Data Figure 4 | Structural analysis and segmentation. 

a, Segmentation of the TssJLM reconstruction. Left: above the base, ten 
equivalent densities could be defined by segmentation. They are arranged in 
two concentric rings. The internal ring is represented in green in the top 
panel and the external ring is represented in blue in the bottom panel. Right: 
cut-out views of the complex showing the arrangement of the two concentric 
rings at different levels (grey lines) along the periplasmic portion of the 
TssJLM complex. The cut-out views are seen from the bottom of the complex. 
b, Requirement of TssJ lipidation for complex assembly and insertion into the 
outer membrane. Left: membrane fractionation by differential solubilization 
followed by immunoblot analysis. Total membrane extracts from cells 
producing the wild-type TssJLM complex or the TssJLM complex with an 
unacylated variant of TssJ (Cys1-to-Ser substitution, CS) were solubilized 

by lauroyl sarcosine to separate inner membranes and outer membranes. 

His SSM, piagTssL and TssJstrep (indicated on the right) were revealed by anti- 
His, anti-Flag and anti-StrepII antibodies respectively. Controls included 
immuno-detection of the inner membrane DglA diacylglycrol lipase and the 
outer membrane OmpF porin. Wild-type TssJLM complex co-fractionates with 
both the inner and outer membrane fractions whereas the Cys1-to-Ser 
substitution mutated complex co-fractionates only with the inner-membrane 
fraction. Top right: negative-stain EM of the mutated TssJcsLM complex. 
Representative micrograph of the data set used for image processing. Isolated 
TssJcsLM complexes were clearly visible (white circles). Bottom right: gallery of 
representative class averages generated after reference-free 2D classification 
in Relion. End to side views are shown from top left to bottom right. 

c, Orientation of the TssJLM complex in the cell envelope. Left: schematic 
representation of the TssJ (J, orange), TssL (L, blue) and TssM (M, green) 
proteins. Their localization, main characteristics (lipidation or transmembrane 
segments shown in black) and the location of the 6 His and StrepII tags 


(red balls) are indicated. The strepII and 6X His tags were introduced at the C 
terminus and N terminus of TssJ and TssM respectively. Middle: immune and 
Nanogold labelling coupled to EM. Anti-StrepII or Nanogold-NTA were 
incubated with the TssJLM complex and visualized by negative-stain EM. A 
gallery of representative views is presented (top row, StrepII labelling; 
bottom row, Ni-NTA labelling). StrepII antibodies (a schematic diagram with 
StrepII antibodies depicted as blue circles is shown on top) and Nanogold-NTA 
are highlighted in red circles. Right: the positions of the StrepII antibody 
(targeting TssJ C terminus) and of the Ni-NTA gold particle (targeting TssM 
N terminus) are indicated on the TssJLM reconstruction. d, SAXS data and 
low-resolution structure of the TssM,-Tss] complex. Top left: experimental 
scattering data (green crosses) and the fitting curve (continuous red line) 
calculated from an ab initio model of the TssM,-Tss] complex. Top right: 
Guinier plot (dots) with the linear fit (continuous line). Bottom left: distance 
distribution function of the TssM,-TssJ complex. Bottom right: SAXS envelope 
(grey surface) of the “best representative’ model of the TssM,-TssJ complex. 
Each view is rotated by 90° around the y-axis. e, Location of the TssMp-TssJ 
complex SAXS envelope in the 3D reconstruction of TssJLM complex. Left: 
the volume of the TssM,-Tss] complex determined by SAXS was docked 
into the EM 3D reconstruction of the TssJLM complex (top). Two optimal 
docking positions were found, both with 82% correlation with the EM map 
(coloured magenta and pink). The corresponding volumes in the EM map were 
extracted (bottom). They correspond to the same volume displayed in 
Extended Data Fig. 4a. Right: direct comparison of the SAXS (magenta) and 
EM (blue and green) volumes corresponding to the TssM,—TssJ complex. The 
volumes are equivalent in size and shape. f, TssL cytoplasmic domain docking 
into the TssJLM complex base. Fitting of the TssL cytoplasmic domain 
(TssL,yto)”* dimer in green ribbons in the hooks found in the base. Top and 
bottom: side and bottom views, respectively. 
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Extended Data Figure 5 | Crystal structure of the TssJ-TssM C-terminal 
domain complex. a, Amino-acid sequence of TssM. The different domains 
as well as the fragments used in this study are indicated (yellow, transmem- 
brane helix; grey, cytoplasmic domain; green, blue and purple, periplasmic 
domain; blue and purple, C-terminal domain corresponding to the TssM32ct 
fragment; purple, C-terminal domain corresponding to the TssMo6c 
fragment). b, Crystal structure of the TssM32c;-nb25 complex. The two 
proteins are represented as rainbow-coloured ribbons. The complementary 
determining regions (CDRs 1-3, coloured blue, green and red, respectively) of 
nb25 are indicated. The inset highlights the TssM32c;-nb25 interface: the 
TssM32c; surface is coloured beige whereas nb25 is represented as rainbow- 
coloured ribbons; the side chains of the amino acids in contact with TssM32¢ 
are indicated. The nb25 nanobody binds the TssM C-terminal domain, and 
covers a surface area of 580 A? by inserting its protruding CDR3 between 
TssM32c; loops L5-6 and L9-10. The contacts between the two proteins are 
listed in Extended Data Table 2a. c, Crystal structure of the TssMz¢c,-TssJ 
complex. Left: the two proteins are represented as ribbons and coloured in 
rainbow mode. Middle: same view rotated by 90°. The TssJ loop 1-2, previously 
shown to contact TssM”, is indicated. Right: TssM26c,-TssJ interface. Top 
panel: the TssM36c; surface is coloured violet, whereas TssJ is represented as 
rainbow-coloured ribbons. The TssJ side-chains of the amino acids in contact 
with TssM are indicated. The loops are numbered according to the flanking 


B-strands. Bottom panel: the TssJ surface is coloured beige whereas TssM 6c; is 
represented as rainbow-coloured ribbons. The TssM side-chains of the amino 
acids in contact with TssJ are indicated. The contacts between the two 
proteins are listed in Extended Data Table 2b. d, Comparison of the binding 
sites of nb25 and TssJ on TssM. Left: the structure of the TssM26c,-Tss]J 
complex (rainbow coloured) has been superimposed to the structure of the 
TssM32c;-nb25 complex (only nb25 is shown in grey for clarity). Right: the 
same partners as in the left panel in surface representation. TssM¢6c; (violet), 
TssJ (green) and nb25 (pink). e, Insertion of the TssJ lipid anchor in the outer 
membrane. Left: TssJ structure”’ with the N-terminal 24 residues (absent in 
the crystal structure). This N-terminal extension (in magenta), predicted to be 
disordered, was modelled in Chimera using Modeller. The first cysteine residue 
is acylated to allow anchorage to the inner leaflet of the outer membrane 
(orange rectangle). Right: docking of the TssM6c,-TssJ complex in the EM 3D 
reconstruction of the TssJLM complex (only the uppermost (tip) part of the 
TssJLM complex is shown). Left panel: two TssM36c,-TssJ were docked into the 
inner and outer pillars of the tip complex. Right panel: docking in each 

pillar of the TssJLM tip complex (C; symmetry). f, Hydrophobicity of the 
TssMa6c:-T'ss] complex. Surface representation of the TssMz6¢,-TssJ decamer 
(left, top view; right, side view). The hydrophobicity of the surface residues is 
displayed (blue to red scale from most hydrophilic to most hydrophobic). 

No obvious hydrophobic patch is visible at the surface of the complex. 
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Extended Data Figure 6 | Cell surface accessibility of TssM C-terminal 
domain. a, Functionality of the TssM cysteine variants. Hcp release was 
assessed by separating whole cells (C) and supernatant (S) fractions from the 
wild-type (WT) 17-2 strain and its AtssM derivative producing a wild-type 
allele of TssM or TssM cysteine substitution derivatives (as indicated). A total 
of 1 X 10° cells and the TCA-precipitated material from the supernatant of 
2X 10° cells were analysed by western blot using anti-HA monoclonal 
antibody (lower panel) and anti-TolB polyclonal antibodies as a lysis control 
(upper panel). The molecular mass markers (in kilodaltons) are indicated on 
the left. b, Cysteine substitution labelling. Accessibility to cysteine residues 
positioned in TssM domain 4 loops was assessed by treating isolated 
membranes (M) or whole cells (WC) of the indicated strain (WT, wild-type 
17-2; AtssBC; AtssJ) producing the indicated TssM cysteine derivative (in red 
letters) with the cysteine-reactive, membrane-impermeant BSA-maleimide 


(BSA-mal). Samples corresponding to a total of 5 X 10” cells were analysed 
by western blot using anti-Flag monoclonal antibody. The position of the 
TssM protein (~125 kDa) is indicated as well as that of a retarded band 
corresponding to BSA-maleimide-coupled TssM (~190 kDa; asterisk). The 
molecular mass markers (in kilodaltons) are indicated. c, Close-up of the 
TssMo6c:-TssJ interface. TssMy6c; is represented in blue ribbons. TssJ is 
represented in orange ribbons and orange transparent surface. TssM residues 
accessible from the cell exterior when the T6SS is functional are indicated by 
yellow spheres whereas unaccessible residues are shown by grey spheres. 

The accessible residues 989 and 1005 are buried at the interface between TssM 
and TssJ, suggesting that this interface is probably disrupted during T6SS 
assembly and/or function. Left and right panels are orthogonal views of the 
same molecule. 
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Extended Data Figure 7 | Comparison with other bacterial secretion 
systems and model for channel opening. a, Comparison between the T6SS 
TssJLM membrane core complex structure and other bacterial secretion 
systems. From left to right, the E. coli AcrAB-TolC multi-drug efflux pump 
(EMDB accession number emd-5915)’*, the EAEC T6SS membrane core 
complex (this study, EMDB accession number emd-2927), the Shigella T3SS 
transmembrane complex (EMDB accession number emd-1617)” and the 

E. coli R388 T4SS complex (EMDB accession number emd-2567)”°. The 
positions on the inner membrane (IM) and outer membrane (OM) are 
indicated (C, cytoplasm; P, periplasm). Scale bar, 10 nm. b, Docking of the Hcp 
tube/VgrG spike into the TssJLM 3D reconstruction. Left: before sheath 
contraction. The Hcp tube/VgrG spike (VgrG in yellow and Hep in green; 
surface representation) was manually docked in the 3D reconstruction of 
TssJLM complex (grey surface). The diameter of the channel defined by the 
closed tip complex is not large enough to allow the passage of the tube/spike, 


suggesting that large conformational changes probably occur. The cavity at the 
tip of VgrG could be filled by VgrG-bound PAAR modules or toxin effectors”. 
Right: during sheath contraction. The diameter of the C;o-symmetrized 
TssMa6ct model (represented as ribbons) is compatible with the passage of 
the Hcp tube/VgrG spike (same colours as in the left panel). ¢, Closed and open 
forms of the TssM2¢c; oligomer. Crystal structure of TssM6c; represented 

as ribbons and transparent surface. The TssMa6cr o- and B-domains are 
coloured cyan and blue, respectively. The C-terminal «5-helix and the extended 
stretch are coloured pink. Cysteines with extracellular accessibility when the 
TOSS is active are coloured yellow, while the unlabelled ones are coloured 
red. Left: docking of the TssM26c:-TssJ crystal structure in the EM 3D 
reconstruction of the TssJLM tip complex. Top and bottom panels, side and top 
views, respectively. Right: model of a C,9-symmetrized oligomer of the 
TssMa6ct domain. Top and bottom panels, side and top views, respectively. 
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Extended Data Table 1 | Data collection and refinement statistics 


TssM39c,-nb25 TssMo6ct TssMo6c;-TssJ 

Data collection 
Space group P64 P 4,2;2 P 4,2,2 
Cell dimensions 

a, b, c (A) 95.2, 95.2, 172.95 64.0, 64.0, 249.7 85.5, 85.5, 256.4 

a, B, y (°) 90.0, 90.0, 120.0 90.0, 90.0, 90 90.0, 90.0, 90 
Resolution (A) 50.0-1.92(1.97-1.92)* 30.0-1.51(1.6-1.51)* 50.0-2.24(2.38-2.24)* 
Roicics 0.079 (1.08) 0.067 (0.59) 0.067 (0.73) 
T/ol 18.0 (2.0) 19.2 (3.0) 21.5 (3.1) 
Completeness (%) 100.0 (100.0) 99.9 (89.3) 99.7 (98.4) 
Redundancy 11.4 (11.3) 9.9 (9.9) 10 (10) 
Refinement 
Resolution (A) 47.6-1.92(1.97-1.92)* 22.1-1.51 (1.55-1.51)* 49 .3-2.24(2.3-2.24)* 
No. reflections 67543 (4721) 82127 (5359) 46047 (3015) 
Rywork Riree 0.184/0.21(0.234/0.25) 0.192/0.202(0.241/27.6) — 0.208/0.228(0.224/0.25) 
No. atoms 

Protein 5522 3784 5521 

Ligand/ion 15 22 4 

Water 805 536 379 
B-factors 

Protein 42.3 27.4 51.7 

Ligand/ion 98.8 55.0 81 

Water 49.9 39.5 63.8 
R.m.s deviations 

Bond lengths (A) 0.009 0.010 0.010 

Bond angles (°) 1.05 1.03 1.19 


§Each data set has been collected on a unique crystal. 
*Highest resolution shell is shown in parenthesis. 
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Extended Table 2a. Interactions between TssM and nb25 
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Extended Table 2c. TssM cysteine accessibility. 


n25/CDR3 TssMbpect 
distance | bond 
number type | atom | number | type | atom (A) Cysteine Labelled Labelled Position WAS # 
H position at rest in action 
103 Gly |_o 1063 | Ala| N 2.89 
104 _| me | ca _| 1061 | Gly| o | 3.08 we - ° Nene ee |, ster? 
H - : 
105 | Tyr | N | 1061 | Gly] o | 285 ore a loop pe- eee 
H : 
107 _| Thr | oG1 | 1060 | Pro | o | 267 ei = er Pash? een 
107 Thr | CG2 | 1061 | Gly| cA |_ 3.56 aaa . ° deep Be? eres 
1035 - + loop B7- 8 | 113/113 
109 Tyr | CE1 | 1067 | Ser | CB 3.50 
H 1062 - - loop B11- a5 94/94 
109 Tyr | OH | 1080 | Tyr| 0 3.47 
H 1075 - + loop B10- B11 51/51 
109 Tyr | OH | 1081 | Thr| OG1 | 2.64 
1092 - z helix a5 | 0/0 
110 Ile | CD1 | 1062 | Val| CG2 | 3.73 
1109 ++ ++ C-terminus | NA* 
113 Pro | _O 984 | Gly| CA | 3.12 
H 
113 Pro | _O 985__| Thr | N 3.00 
114 Tyr | CE 982__| Arg | CD 3.55 
114 Tyr | CZ 982__| Arg | NE 3.63 
H 
114 Tyr | OH | 1010 | Trp | NEI | 2.99 
H 
114 Tyr | 0 982__| Arg | NE 2.88 
H 
115 Gl oO 982 | Arg | NH2 | _2.76 
116 Met | O 1008_| Pro |_ CD 3.32 
H 
117 Asp | ODI | 982__| Arg | NH2 | 2.80 
TssJ TssM 
Loop L3,4 
Loop L1,2 


The letter H in right-hand columns indicates that atoms establish a hydrogen bond. 


WAS, water-accessible surface of the original amino acids (measured in the unbound TssM/in the TssM-TssJ complex). 


*Not visible in the electron density map. 
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G-protein-coupled receptors (GPCRs) signal primarily through G proteins or arrestins. Arrestin binding to GPCRs blocks 


G protein interaction and redirects signalling to numerous G 


-protein-independent pathways. Here we report the crystal 


structure of a constitutively active form of human rhodopsin bound to a pre-activated form of the mouse visual arrestin, 
determined by serial femtosecond X-ray laser crystallography. Together with extensive biochemical and mutagenesis 
data, the structure reveals an overall architecture of the rhodopsin-arrestin assembly in which rhodopsin uses distinct 
structural elements, including transmembrane helix 7 and helix 8, to recruit arrestin. Correspondingly, arrestin adopts 
the pre-activated conformation, with a~20° rotation between the amino and carboxy domains, which opens up a cleft in 
arrestin to accommodate a short helix formed by the second intracellular loop of rhodopsin. This structure provides a 
basis for understanding GPCR-mediated arrestin-biased signalling and demonstrates the power of X-ray lasers for 


advancing the frontiers of structural biology. 


G-protein-coupled receptors (GPCRs) comprise the largest family of 
cell surface receptors, which signal primarily via G proteins or 
arrestins'*. Upon activation, GPCRs recruit heterotrimeric G proteins 
and subsequently G-protein-coupled receptor kinases (GRKs), which 
phosphorylate GPCRs to allow the high-affinity binding to arrestin’. 
Arrestin binding to the receptors blocks their interactions with 
G proteins and leads to the receptor’s desensitization*. The binding 
of arrestins to GPCRs also initiates numerous cellular signalling path- 
ways that are independent of G proteins. Arrestin-mediated signalling 
is therefore a central component of the GPCR functional network. 
GPCRs are targets of one-third of the current clinically used 
drugs. Recent studies have demonstrated that G-protein and arrestin 
pathways are distinct and can be pharmacologically modulated inde- 


pendently using biased GPCR ligands’. Biased GPCR ligands are 
often preferred over unbiased agonists and antagonists, as they selec- 
tively direct the receptor to a subset of partners and can deliver 
therapeutic benefits with fewer undesirable side effects. Research 
towards biased ligands has become a new trend for GPCR-targeting 
therapeutics’. 

The molecular mechanisms of GPCR signalling have been unravelled 
by recent breakthroughs in GPCR structural biology’"°. In the antagon- 
ist-bound state, GPCRs assume a closed conformation with the cyto- 
plasmic ends of the transmembrane (TM) helices packed closely with 
each other”, thus blocking the interactions with G proteins or arrestins. 
In contrast, agonist binding promotes conformational changes in 
GPCRs, including a dramatic movement within the cytoplasmic side 
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of the TM domain*"’, thus allowing activated receptors to recruit 
G proteins or arrestins to mediate downstream signalling. However, 
arrestin coupling to GPCRs may require a conformation of the receptor 
different from that required for coupling with G proteins'*””. 
Rhodopsin is a prototypical GPCR responsible for light percep- 
tion’. Along with the B,-adrenergic receptor (B2AR), rhodopsin has 
served as a model system for studying GPCR signalling’®. Figure la 
shows rhodopsin binding to G protein and arrestin. Light induces 
isomerization of 11-cis retinal to all-trans-retinal (ATR), which acti- 
vates rhodopsin and promotes its interactions with G protein’””’. Light- 
activated rhodopsin is then phosphorylated by rhodopsin kinase 
(GRK1), leading to high affinity recruitment of arrestin that terminates 
the G protein signalling. Activation of rhodopsin can also be achieved 
through mutations, including the E113°7°Q/M257°°Y mutation, 
which yields a constitutively active rhodopsin’ (superscripts in resi- 
dues refer to the Ballesteros-Weinstein numbering”). The crystal 
structure of bovine rhodopsin has been determined in the inactive, 
resting state’, the ligand-free state*’**, and the ligand-activated state 
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Figure 1 | Rhodopsin-arrestin interactions and complex assembly. 

a, Diagram of the binding of rhodopsin (Rho) with G protein and arrestin as 
described in the main text. Labels are 11-cis retinal (ECR), all-trans-retinal 
(ATR), and rhodopsin kinase (GRK1). b, Rhodopsin (Rho) and arrestin 
(Arr) interaction determined by pull-down assay in the absence and presence of 
ATR (top panel). Middle panel, rhodopsin loading controls. Bottom panel, 
relative binding of *°S-labelled arrestin was determined by densitometry (n = 3, 
error bars, s.d.). c, Binding of Hiss-MBP-rhodopsin (E1 13°8Q/M257°"Y) 
protein to biotin-MBP-arrestin (wild type (WT) and 3A) measured by 
AlphaScreen in the absence or presence of 5 uM ATR. The first six columns are 
controls (luminescence signals in the presence of only one of the binding 
partners; n = 3, error bars, s.d.). d, Competition of arrestin binding to 
rhodopsin was determined by a homologous AlphaScreen assay and the ICs 
value was derived from repeat experiments ( = 3, error bars, s.d.). e, Negative 
stain electron microscopy images of rhodopsin-arrestin complexes without or 
with T4L at the N terminus; right panel, overlay of the electron microscopy 
image with the structures of T4L, rhodopsin and arrestin. m, detergent micelle. 
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in complex with a G-protein peptide’*. Arrestin structures have been 
determined in the inactive**’* and pre-activated form’’. Recent elec- 
tron microscopy analysis has revealed the assembly and conforma- 
tional dynamics of the B,AR-f-arrestin complex”’. Here we report 
the crystal structure of an active form of human rhodopsin bound to 
a pre-activated mouse visual arrestin, determined by serial femtosecond 
crystallography (SFX). The structure has been confirmed by electron 
microscopy, double electron-electron resonance (DEER) spectroscopy, 
hydrogen-deuterium exchange mass spectrometry (HDX), cell-based 
rhodopsin-arrestin interaction assays, and site-specific disulfide cross- 
linking experiments. Our study provides a molecular basis for under- 
standing GPCR-mediated arrestin-biased signalling. 


Characterization and crystallization 


To characterize the rhodopsin-arrestin interaction, we expressed and 
purified E113*7°Q and E113°78Q/ M257°*°Y mutant receptors 
(Extended Data Fig. 1a). These mutations were introduced in the 
context of the N2N"™C/N282°°'?C mutant that is known to create 
a disulfide bond that increases rhodopsin stability without affecting its 
activity” *°. The N2N*"™C/N282°°'°C mutant is referred to as our 
wild-type control. To determine the interaction between rhodopsin 
and arrestin, we developed a bead binding pull-down assay. In 
this assay, rhodopsin expressed as a fusion with a maltose-binding 
protein (MBP) at its N terminus was bound to amylose beads, which 
were then used to pull down in vitro translated arrestin labelled 
with *°S. Wild-type arrestin has weak background binding to wild- 
type rhodopsin (Fig. 1b). The E11 3°?8Q mutation increased wild-type 
arrestin binding by twofold to threefold, and the E113°*°Q/ 
M257°*°Y mutation further increased the binding of wild-type 
arrestin in the presence of all-trans-retinal (fourfold to eightfold). 
In contrast to the relatively weak binding of wild-type arrestin, the 
binding of 3A arrestin, a pre-activated form of arrestin that obviates 
the need for receptor phosphorylation for high affinity binding 
through three alanine mutations in L374, V375, and F376 in 
the C-terminal tail of arrestin, is much stronger. In the absence of 
all-trans-retinal, we observed a nearly 30-fold increase of 3A arrestin 
binding to the E113°7°Q/M257°*°Y receptor. All-trans-retinal 
further increased 3A arrestin binding to the E113°*°Q/M257°°Y 
receptor by ~60-fold above the binding of wild-type arrestin to 
wild-type rhodopsin (Fig. 1b and Extended Data Fig. 1b). 

We also measured rhodopsin-arrestin interactions using 
AlphaScreen assays (Extended Data Fig. 1c) with Hisg-tagged 
rhodopsin and biotin-tagged arrestin. Wild-type arrestin interacted 
weakly with the E113°*°Q/M257°*°Y rhodopsin, regardless of the 
presence of all-trans-retinal (Fig. 1c). As a positive control, the 
GaCT-HA peptide, a high affinity peptide variant of the C 
terminus of G-transducin (G,)*, readily interacted with the 
E1137 Q/M257°*°Y receptor in the absence of all-trans-retinal, 
and addition of all-trans-retinal slightly increased this interaction 
(Extended Data Fig. 1d). Quantitative competition using unlabelled 
3A arrestin or GxCT-HA with the E113°*°Q/M257°*°Y receptors 
revealed an ICs) value of 15nM and 700nM for the binding of 
3A arrestin and the GaCT-HA peptide, respectively (Fig. 1d and 
Extended Data Fig. 1d). The strength of the interaction between the 
E113°78Q/M257°°Y rhodopsin and the 3A arrestin is in a similar 
range as the estimated Ky value of 30-80nM for the binding of 
arrestin to the fully activated phosphorylated rhodopsin*’. 

Mixing individually purified proteins did not yield a stable 1:1 
complex, nor did it lead to crystallization. Extensive biochemical 
data support a 1:1 stoichiometry in the rhodopsin-arrestin 
complex***’. Therefore, we engineered a fusion protein in which 
3Aarrestin is linked by a 15-residue linker to the C terminus of 
E113°78 Q/M257°“°Y rhodopsin. We expressed and purified the rho- 
dopsin-arrestin fusion protein, as well as a T4 lysozyme (T4L)-rho- 
dopsin-arrestin fusion, in which a T4L is fused to the N terminus of 
rhodopsin to increase the soluble surface for crystallization (Extended 
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Data Fig. 2a). The T4L-rhodopsin-arrestin fusion protein is mono- 
meric and relatively stable with a T,, of 59 °C (Extended Data Fig. 2b, 
c). Negative stain electron microscopy images revealed that E113°7°Q/ 
M257°*°Y rhodopsin and 3A arrestin form a stable complex with 
arrestin bound to the cytoplasmic side of rhodopsin (Fig. le). The 
T4L-rhodopsin-arrestin fusion protein formed crystals with sizes in 
the range of 5 to 15 um under various lipid cubic phase (LCP) crystal- 
lization conditions (Extended Data Fig. 2d, e). Despite extensive optim- 
ization, the crystals diffracted only to 6-8 A at synchrotron sources 
(Extended Data Fig. 2f). We thus turned our attention to the emerging 
method of SFX* with an LCP injector (LCP-SFX)**"*. 


Structure determination by SFX 


Because of the small size of crystals, we hypothesized that diffraction 
by X-ray free electron laser (XFEL) at the Linac Coherent Light Source 
(LCLS) would improve data quality given the advantages of intense 
and very short XFEL pulses for micrometre-size crystals. In the LCP- 
SFX method, a stream of gel-like LCP with fully hydrated microcrys- 
tals runs continuously in vacuum across the 1.5-m-diameter XFEL 
beam, which delivers 120 X-ray pulses per second with less than 50 fs 
pulse duration and sufficient intensity to capture crystal diffraction 
patterns with a single pulse. Within ~12h of run time, we collected 
over 5 million detector frames, of which 22,262 had more than 40 
diffraction spots as determined by the Cheetah hit-finding software”. 


view view 


Figure 2 | The structure of the rhodopsin-arrestin complex. a, The structure 
of the rhodopsin-arrestin complex in four orientations. The relative 
dimensions of rhodopsin and arrestin are shown in the intracellular view. 
TM1-TM7 indicates rhodopsin transmembrane helices 1-7; H8 is intracellular 
helix 8. b, An overall view of the rhodopsin-arrestin complex shown with 
transparent solid surface. T4 lysozyme (T4L) is omitted from this view. 

c, Crystal packing diagram of the rhodopsin-arrestin complex with T4L as 
yellow ribbon model. 
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Diffraction patterns from 18,874 crystals could be indexed and inte- 
grated using CrystFEL”*. The data were processed according to the 
apparent tetragonal lattice with a large unit cell (a = b = 109.2 A and 
c= 452.6 A). The diffraction was anisotropic with resolution limits of 
3.8 A and 3.3 A along the a*/b* and c* axes, respectively (Supple- 
mentary Table 1). 

The crystals appeared to be pseudo-merohedrally twinned in 
P2,2,2, (Supplementary Table 2) and the structure was solved by 
molecular replacement using known structures of active rhodopsin” 
and pre-activated arrestin’ (details in Methods). The structure con- 
tains four rhodopsins (residues 1-326), four arrestins (residues 
12-361 with a small missing loop of residues 340-342), and three 
TALs (residues 2-161 in complexes A and D; residues 2-12 and 
58-162 in complex C; no T4L was modelled in complex B owing to 
poor density) (Fig. 2). The final structure was refined to Rwork and 
Rfree Of 25.2% and 29.3%, respectively, with excellent geometry 
(Supplementary Table 1b). The overall arrangement of the T4L-rho- 
dopsin-arrestin complex is well supported by the electron density 
maps (Extended Data Fig. 3), including a 3,000 K simulated annealing 
omit map. Because of the twinned nature of the data sets, we per- 
formed extensive structure-validation experiments, including DEER, 
HDX, cell-based rhodopsin-arrestin interaction assays and site-spe- 
cific disulfide cross-linking. Below we describe the rhodopsin- 
arrestin structure and the results of validation experiments. 


Overall structure of the rhodopsin-arrestin complex 


The most striking feature of the rhodopsin-arrestin complex is the 
asymmetric binding of arrestin to rhodopsin (Fig. 2) and this asym- 
metric arrangement is similar in all four complexes in the asymmetric 
unit, providing an independent confirmation of the rhodopsin- 
arrestin complex assembly (Extended Data Fig. 4). Figure 2a shows 
one rhodopsin-arrestin complex in four 90° orientations. From the 
intracellular (IC) view, rhodopsin and arrestin have similar heights, 
but the width of arrestin is nearly three times that of rhodopsin. 
Figure 2b shows the rhodopsin-arrestin complex in a transparent 
surface, whose overall arrangement of the domains can be fit into 
the electron microscopy images (Fig. le). Figure 2c shows the layered 
or type I packing of the complex in the crystal lattice with alternating 
hydrophilic and hydrophobic layers comprising arrestin, T4L and 
rhodopsin, respectively (Fig. 2c). This arrangement allows the com- 
plex to form extensive packing interactions that involve all soluble 
portions of the proteins, with the arrestin being the central mediator 
for packing with T4L, rhodopsin and arrestin from neighbouring 
symmetry-related molecules. 

To validate the assembly of the rhodopsin-arrestin complex, we 
used DEER to determine intermolecular distances within the com- 
plex*’. The DEER distances from residue Y74**" of rhodopsin to three 
arrestin residues (T61, V140, and S241) measured in a non-fused 
rhodopsin-arrestin complex were 28 A, 23 A and 33 A, closely match- 
ing the distances of 28 A, 22 A and 34 A, respectively, as observed in 
the crystal structure (Fig. 3). The intramolecular distances in the 
active arrestin bound to light-activated phosphorylated rhodopsin 
have also been studied extensively by DEER”, and all of them match 
exceedingly well with the crystal structure (Supplementary Table 3). 
Together, these data support the conclusion that the complex formed 
by fusion proteins closely resembles the physiologically relevant com- 
plex formed by individual proteins. 


The rhodopsin-arrestin interface 

The four rhodopsin-arrestin complexes in the asymmetric unit adopt 
nearly identical interfaces (Extended Data Fig. 4a), which are stabi- 
lized by intermolecular interactions as summarized in Supplementary 
Table 4. The total surface area buried in the interface is 1,350 A*, 
which is substantially smaller than the area (2,576 A’) buried in the 
B,AR-G, complex®. Unlike the continuous interface observed in the 
B.AR-G, complex, the rhodopsin-arrestin complex has four distinct 
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Figure 3 | DEER validation of rhodopsin-arrestin complex assembly. a, An 
overall view of rhodopsin-arrestin assembly showing the three intermolecular 
distances based on the models of the R1 nitroxide pairs at rhodopsin residue 
y74""! and three arrestin residues T61, V140, and $241 based on the crystal 
structure. b-d, The experimental distance distributions between the nitroxide 
spin labelled R1 pairs of rhodopsin Y74*™" and bovine arrestin $60, V139, 
and L240, which are in equivalent positions to mouse arrestin T61(b), V140 (c), 
and S$241(d) as labelled in the figure. 


arrestin interface patches (Fig. 4a, b and Extended Data Fig. 4b). 
The first arrestin interface patch consists of the finger loop (residues 
Q70 to L78), which adopts a short o-helix and forms extensive inter- 
actions with the C terminus of TM7 and the N terminus of helix 8, as 
well as the loop residues (ICL1) of rhodopsin (Fig. 5a). Interactions of 
arrestin with TM7 of rhodopsin are of particular interest because 
conformational changes in TM7 have been implicated in arrestin- 
biased signalling’*’*. Moreover, the close interactions between 
rhodopsin’s helix 8 and arrestin have been shown to be essential for 
high-affinity binding of arrestin to the activated rhodopsin*. The 
second arrestin interface patch is formed by the middle loop (residue 
V140 region) and the C-loop (residue Y251 region at the central loop 
in the arrestin C domain) that interact with the ICL2 of rhodopsin, 
and the arrestin back loop (R319 and T320) that interacts with the 
C terminus of TM5. The middle and C-loops are close to each other in 
the inactive arrestin, but move apart upon its activation to form a cleft 
that accommodates the ICL2 of rhodopsin, which adopts a short helix 
(Fig. 4a, b). The positions of the finger loop and the C-loop are 
supported by a composite omit 2F,—F, electron density map 
(Extended Data Fig. 3a, d). The third arrestin interface patch is the 
B-strand (residues 79-86), which follows the finger loop and interacts 
with residues from TM5, TM6 and ICL3 of rhodopsin. The fourth 
arrestin putative interface patch is mostly between its N-terminal 
B-strand (residues 11-19) and the C-terminal tail of rhodopsin, which 
was not visible in the electron density map owing to the apparent 
flexibility of this region, but was computationally modelled based 
on HDX and disulfide cross-linking data described below 
(Extended Data Fig. 5). Consistent with the crystal structure, these 
arrestin elements have been implicated in various aspects of arrestin 
activation and receptor binding?™. 

To further characterize the rhodopsin-arrestin interfaces, we per- 
formed three additional sets of validation experiments. The first 
was HDX, which probes the dynamics and stability of protein 


564 | NATURE | VOL 523 | 30 JULY 2015 


TMS 


B strand 
interface 


Figure 4 | The rhodopsin-arrestin interface and its validation by HDX. 

a, b, Two overall views showing the four interface patches of the rhodopsin-— 
arrestin complex. c-e, Mapping of HDX on the rhodopsin-bound arrestin 
structure. Rhodopsin is coloured in red and arrestin is coloured based on the 
exchange rate differences between free 3A arrestin and rhodopsin-bound 
arrestin as shown in Extended Data Fig. 6a. This figure was made using a 
computational model of the full rhodopsin-arrestin complex. 


complexes**. Compared with free arrestin, the rhodopsin-bound 
arrestin has several regions that are protected from exchange, includ- 
ing the finger loop and the N-terminal B-sheets, consistent with their 
location in the rhodopsin-binding interface (Fig. 4c, d and Extended 
Data Fig. 6a). The hydrogen to deuterium exchange rate of arrestin in 
the complex is lower than that for free arrestin across the whole 
protein, indicating that arrestin is stabilized by complex formation, 
consistent with the results of previous HDX experiments“ and ther- 
mal stability assays, which revealed that the melting temperature of 
free arrestin is six degrees lower than that of the complex (53 °C versus 
59 °C, Extended Data Fig. 6b). 

The second set consisted of Tango assays”, which have been used 
for probing GPCR-arrestin interactions (Extended Data Fig. 7a). 
Wild-type rhodopsin and wild-type arrestin had a very low basal 
interaction and all-trans-retinal increased the binding by approxi- 
mately threefold. In contrast, E113°7°Q/M257°°Y rhodopsin showed 
a high level of interaction with the pre-activated 3A arrestin, and 
addition of all-trans-retinal further increased the binding signal by 
approximately fivefold. Mutations in finger loop (D74, M76, G77, 
and L78), middle loop (Q134 and D139), and C-loop (L250 and 
Y251) decreased rhodopsin-arrestin binding (Extended Data 
Fig. 7b). Correspondingly, mutations in rhodopsin residues involved 
in arrestin binding also weakened the interaction (Extended Data 
Fig. 7c), consistent with the complex crystal structure. 

The third set consisted of site-specific disulfide cross-linking 
experiments, which have been used to validate structures based on 
the geometry requirements for disulfide bond formation (Ca-Ca 
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Figure 5 | Validation of the rhodopsin-arrestin interface by disulfide bond 
cross-linking. a-e, Structure and cross-linking of arrestin with rhodopsin. 
Panels are arrestin finger loop with rhodopsin TM7 and helix 8 (a); arrestin 
middle loop with rhodopsin ICL2 (b); arrestin C-loop residue Y251 with 
rhodopsin TMS (c); arrestin B-strand interface residues with residues 

of rhodopsin TM5, ICL3, and TM6 (d); and arrestin’s N terminus with 
rhodopsin’s C-tail (e). Rhodopsin K311 is marked with a red asterisk and the 
side chain of arrestin M76 is shown in full from computation modelling of the 
full rhodopsin-arrestin complex, which was also used in panel e. Black 
asterisks, arrestin; arrowheads, rhodopsin/arrestin crosslinking adduct. 


distances of 5-9A and appropriate side-chain orientations). We engi- 
neered cysteine pairs at the binding interface of arrestin and rhodop- 
sin, which were tagged with Flag and HA, respectively. Over 314 
co-expression combinations were tested and monitored by SDS- 
PAGE followed by western blotting (Extended Data Fig. 8). The 
results are summarized in Supplementary Table 5. Every interface 
residue in arrestin was included in the study and the results closely 
agree with the crystal structure. For example, the distances from the 
Co atom of the finger loop residue G77 of arrestin to the Ca atoms of 
N3107°”, K311°*8 and Q312° in rhodopsin fit the requirement for 
disulfide bond formation (Fig. 5a). G77C cross-linked efficiently with 
N310”°’C and Q312°°C, but not with K311°*8 C because the CB of 
K311°** points away from G77 (Fig. 5a). Neither did G77C show 
cross-linking with a large set of other rhodopsin residues, indicating 
the high specificity of the cross-linking experiments (Extended Data 
Fig. 8c and Supplementary Table 5). In contrast, several other mutants 
in the finger loop region (D74C, M76C, and L78C) readily cross- 
linked with Q312°°C from helix 8 (Fig. 5a). The cross-linking results 
of these four finger loop residues not only matched the crystal struc- 
ture, but also agreed well with the results from the Tango assays 
(Extended Data Fig. 7b). In addition, mutants of three N-terminal 
finger loop residues (Q70C, E71C, and D72C) were cross-linked to 
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mutants in rhodopsin ICL1 T70'"'C and K67'"'C, respectively 
(Extended Data Fig. 7d). 

We also observed cross-linking of the arrestin middle loop (D139) 
with rhodopsin ICL2 (G149'"7) (Fig. 5b), of the arrestin C-loop 
(Y251) with rhodopsin TMS5 (T229°* and A233°°*) (Fig. 5c), and 
of the arrestin B-strand (residues 79-86) that follows the finger loop 
with rhodopsin TM5, TM6, and ICL3 (Fig. 5d). Additional cross- 
linking was observed in two back-loop residues R319C and T320C 
of arrestin with Q237'“'°C of TMS in rhodopsin (Extended Data Fig. 
8e). Furthermore, extensive cross-linking of the arrestin N terminus 
with the C-terminal tail of rhodopsin was detected, including R19 of 
arrestin with $334‘"™ of rhodopsin (Fig. 5e), K16 of arrestin with 
$338C"™ and K339“*™ of rhodopsin, and V11 and S10 of arrestin 
with the final eight residues of rhodopsin (Supplementary Table 5). 
Together, these cross-linking experiments further validated the inter- 
face assembly of the rhodopsin-arrestin complex. 


Possible structural mechanisms for biased signalling 


The rhodopsin-arrestin complex represents the first crystal structure 
of a GPCR bound to arrestin and provides an opportunity to examine 
the mechanism of arrestin-biased signalling. Although a crystal struc- 
ture of G protein-bound rhodopsin is not available, several structures 
of rhodopsin bound to GaCT and analogue peptides have been deter- 
mined''*?*° and reveal that the arrangement of TM helices in light- 
activated rhodopsin is similar to that in the G-protein-bound B,AR 
complex, with the exception of TM6, whose outward movement in 
B,AR is much more pronounced upon binding to G protein®. The 
arrestin-bound rhodopsin has its intracellular end of TM6 moved 
outward by approximately 10 A relative to its inactive structure 
(Fig. 6a, b and Extended Data Fig. 9). This is in contrast to the 14A 
outward movement of TM6 reported in the G-protein-bound B,AR 
complex*. Compared to the active conformation of rhodopsin bound 
to GaCT peptides''**”**’, arrestin-bound rhodopsin has additional 
conformational differences in TM1, TM4, TM5, and TM7 (Fig. 6c, d 
and Extended Data Fig. 9), and these unique structural features may 
constitute essential elements for arrestin-biased signalling. 

The molecular assembly observed in the rhodopsin-arrestin 
complex also provides a general model for arrestin recruitment by 
phosphorylated rhodopsin-like class A GPCRs. In the computational 
model of the full complex, the highly cationic N-terminal domain of 
arrestin is paired with the C-terminal tail of rhodopsin (Extended 
Data Fig. 10). Based on the extensive disulfide crosslinking data and 
computation modelling, phosphorylated $334, S338 and $343 can 
form tight ionic interactions with three positively charged pockets 
at the N terminus of arrestin (Extended Data Fig. 1la-d). These 
results support a model of arrestin activation by phosphorylated rho- 
dopsin through the C-tail exchange mechanism (Fig. 6e)?. The 
displacement of the arrestin C terminus by the phosphorylated rho- 
dopsin C-tail destabilizes the polar core of arrestin®, thus allowing for 
a 20° rotation of the N- and C- domains of arrestin that opens a cleft 
between the middle and C-loops into which the ICL2 helix of rho- 
dopsin can fit. The ionic interaction between rhodopsin and arrestin is 
consistent with the fact that it is highly salt sensitive in our 
AlphaScreen assay (Extended Data Fig. 1le), in agreement with the 
salt-sensitive binding of phosphorylated rhodopsin to arrestin****. 
Importantly, the cytoplasmic face of the rhodopsin TM bundle is 
highly positively charged, whereas the finger loop (residues 70-78) 
contains three conserved negatively charged residues (E71, D72, and 
D74) (Extended Data Fig. 10). Thus, the interaction of arrestin with 
the rhodopsin TM bundle is mediated not only by shape but also by 
charge complementarity. Arrestins are highly conserved with only 
four subtypes in vertebrates. In contrast, there are hundreds of 
GPCRs, with cytoplasmic interfaces that are mainly non-conserved. 
However, the positive charge property is a common feature on the 
cytoplasmic side of a number of GPCR structures (Extended Data 
Fig. 12). Electrostatic interactions between arrestins and GPCRs 
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Figure 6 | Structural basis of arrestin-biased signalling and arrestin 
recruitment. a, b, Two views of structural overlays of arrestin-bound 
rhodopsin (green) with inactive rhodopsin (pink). ¢, d, Two views of structural 
overlays of arrestin-bound rhodopsin (green) with GaCT peptide-bound 
rhodopsin (orange). e, A cartoon model of arrestin recruitment by a 
phosphorylated and active rhodopsin. In the dark state, the receptor is inactive 
(R-state) and arrestin is in the closed state (basal state). Receptor activation and 
phosphorylation (P-R* state) allow the phosphorylated C-terminal tail of 
rhodopsin to bind to the N-domain of arrestin (pre-activated state), thus 
displacing the arrestin C-terminal tail. This displacement destabilizes the polar 
core of arrestin, which allows a 20° rotation between the arrestin N- and 
C-domains, leading to the opening of the middle loop (ML) and C-loop (CL) to 
accommodate the ICL2 helix of rhodopsin (fully activated state). The activated 
receptor also opens the cytoplasmic side of the TM bundle to adopt the 
finger loop (FL) of arrestin. In this model, the tip of arrestin’s C-domain 
contacts the membrane (red asterisk). 


may represent an adaptive mechanism for arrestins to pair promis- 
cuously with the large number of GPCRs. 

The asymmetric orientation of the bound arrestin with regard to 
the relative positions of its N-C domains in respect to the membrane 
has important implication in its binding to rhodopsin (Fig. 2a, b). 
Such asymmetric assembly brings the arrestin C-domain towards the 
membrane, with the C-edge either being touched or embedded in the 
membrane layer (Extended Data Fig. 13). The C-edge is comprised of 
conserved hydrophobic residues (F197, F198, M199, F339, and L343). 
It has been puzzling why single alanine mutations at these residues 
would affect arrestin binding to rhodopsin given how far away they 
are from the receptor’. The close proximity of these hydrophobic 
residues to the membrane surface may provide an explanation for 
the effects of these mutations on rhodopsin binding. GPCR signalling 
regulator proteins are normally membrane-associated through lipid 
modifications, as is the case for GPCR kinase 1 (GRK1) and the 
G-protein subunits Ga and GBy. Yet, there is no known lipid 
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modification for any of the arrestins. We speculate that the conserved 
hydrophobic patch at the C-tip of arrestin may function as a lipid- 
interacting module that helps to stabilize its interaction with the 
receptor. Furthermore, one primary function of arrestin is to mediate 
endocytosis of ligand-activated GPCRs and the highly asymmetric 
nature of the rhodopsin-arrestin assembly may facilitate the mem- 
brane curvature for subsequent endocytotic processes. Alternatively, 
the remote C-tip could serve as the binding site of a second rhodopsin, 
which has been proposed to form dimers in the rod outer-segment 
disc membrane”. 

LCP-SFX is a new technology that has been used to determine 
several crystal structures*****°. Rhodopsin-arrestin is a challenging 
membrane protein complex and obtaining a structure of this complex 
at a sufficiently high resolution was an intractable task using existing 
methods that include synchrotron-based crystallography and cryo- 
electron microscopy”. The rhodopsin-arrestin complex structure 
reported here demonstrates the utility of X-ray lasers when combined 
with SFX and an LCP crystal delivery system**. The SFX method is 
relatively new and under continuous development. Given its success 
in solving the rhodopsin-arrestin structure, we expect that X-ray 
lasers, with further method development, will continue to provide 
breakthrough insights into biology and chemistry. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Protein preparation. We used human rhodopsin and mouse visual arrestin-1 in 
this study. The T4L-rhodopsin-arrestin fusion protein was expressed using a 
tetracycline-inducible expression cassette encoding a fusion protein with 
Hisgs-MBP-MBP followed by a 3C protease cleavage site at the N terminus of 
the T4L-rhodopsin-arrestin. In this engineered construct, we have fused a 
cysteine-free T4L (residues 2-161 with C54T and C97A) to the N terminus of 
a rhodopsin that contains four mutations: N2“'*™"C and N282"°'3C to form a 
disulfide bond, and E113°7*Q and M257°*°Y for constitutive receptor activity. 
The C terminus of rhodopsin was fused to 3A arrestin (L374A, V375A, F376A, 
residues 10-392) with a 15 amino acid linker (AAAGSAGSAGSAGSA). 

The fusion protein constructs were expressed in HEK293S cells (Invitrogen) 
transiently transfected using Lipofectamine 2000 (Invitrogen). Cells were trans- 
fected at a density of 2 X 10° cells per ml at a 100 ml scale in SEM4TransFx-293 
(HyClone). Six hours post-transfection, cells were diluted tenfold with 
CDM4HEK293 medium (HyClone). When the cell density reached approxi- 
mately 4 X 10° cells per ml, protein expression was induced by the addition of 
doxycycline to a final concentration of 1 1g ml’. After 24h induction, cells were 
harvested, resuspended in hypotonic buffer (20 mM HEPES-Na, pH 7.5, 10 mM 
NaCl, 10 mM MgCl,) supplemented with EDTA-free protease inhibitor cocktail 
(Roche), followed by Dounce homogenization. The lysate was centrifuged at 
45,000 r.p.m. at 4°C for 1h. The membranes were solubilized in 20 mM Tris- 
HCl, pH7.4, 100 mM NaCl, 10% glycerol, 0.5% (w/v) n-dodecyl-B-p-maltopyr- 
anoside (DDM, Anatrace), 0.1% (w/v) cholesteryl hemisuccinate (CHS, 
Anatrace), and protease inhibitor cocktail for two hours at 4 °C. The supernatant 
was isolated by centrifugation at 45,000 r.p.m. for one hour, and incubated with 
amylose beads (New England Biolabs) at 4 °C overnight. Typically 10 ml of resin 
were used for supernatant from one litre of the original cell culture. The resin was 
washed with 200 ml of washing buffer (10 mM Tris-HCl, pH 7.4, 100 mM NaCl, 
0.005% (w/v) lauryl maltose neopentyl glycol (MNG-3, Anatrace), 0.001% (w/v) 
CHS) and the protein was eluted with washing buffer containing 10 mM maltose. 
The eluted fusion protein was concentrated to 40-50 mg per 1 ml and the Hisg- 
MBP-MBP tag was cleaved by overnight incubation with 3C protease at 4 °C. The 
cleaved Hiss~MBP-MBP tag was removed by incubating with 300 pl Ni>*-NTA 
beads (Qiagen) for three hours at 4 °C. The purified T4L-rhodopsin-arrestin was 
collected and concentrated to 30 mg ml for crystallization. 

Wild-type rhodopsin and rhodopsin mutants E113*78Q and E113°78 
Q/M257°“°Y used for MBP pull-down and AlphaScreen assays were expressed 
from the same vector with a Hiss—MBP tag at the N terminus. Protein expression 
and purification were similar as for the rhodopsin-arrestin fusion protein used 
for crystallization, with the difference that proteins were not eluted, but remained 
bound to beads. 

To generate biotinylated proteins for the AlphaScreen assays, wild-type and 
3A arrestin (residues 10-392) open reading frames were cloned with N-terminal 
avitag-MBP tag into the first expression cassette of a modified pET-Duet express- 
ion vector (Novagen), and the biotin ligase gene BirA was cloned into the second 
cassette. The 14 amino acid avitag contains a single lysine that is efficiently 
biotinylated in vivo by the BirA protein®'. BL21 (DE3) cells transformed with 
the expression plasmid were grown in LB broth at 16 °C to an ODgoo of ~1.0 and 
induced with 0.1 mM IPTG in the presence of 40 1M biotin for 16h. Cells were 
harvested, resuspended in 50 ml extraction buffer (20mM Tris-HCl, pH 8.0, 
150 mM NaCl, and 10% glycerol) per two litres of cells, and passed three times 
through a French Press with pressure set at 1,000 Pa. The lysate was centrifuged at 
16,000 r.p.m. in a Sorvall SS-34 rotor for 30 min, and the supernatant was loaded 
on a 5ml amylose HP column (GE Healthcare). The column was washed with 
100 ml of wash buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl, and 10% glycerol) 
and eluted in buffer containing 20mM Tris-HCl, pH 8.0, 150mM NaCl and 
20 mM maltose. The eluted biotin-MBP-arrestin was concentrated and further 
purified by size-exclusion chromatography through a HiLoad 16/60 Superdex 
200 column (GE Healthcare) in 20mM Tris-HCl, pH 8.0, and 150mM NaCl. 
Monomeric protein was collected for further assay. 

Untagged 3A arrestin (residues 10-392) was expressed as a Hiss-SUMO fusion 
protein from the expression vector pSUMO (LifeSensors). The expression and 
purification of untagged arrestin followed the same method as for biotin-MBP- 
arrestin, except that the Hiss-SUMO tag was cleaved overnight with SUMO 
protease” at a protease/protein ratio of 1:1,000 in the cold room. 

MBP pull-down assay. Arrestin was cloned into the pCITE-4a vector (Novagen) 
to allow for transcription from a T7 promoter. The TNT Quick Coupled 
Transcription and Translation kit was used according to the manufacturer’s 
protocol (Promega), to express [°°S]methionine-labelled wild-type arrestin and 
3A arrestin (L374A, V376A, and F376A). Radiolabelled wild-type arrestin and 


mutant arrestin proteins were incubated with Hiss-MBP-rhodopsin fusion pro- 
tein immobilized to 50 jl of maltose agarose bead suspension. Proteins and beads 
were incubated at 4°C for one hour on a rotating platform in binding buffer 
containing 20mM Tris-HCl, pH 7.4, 100mM NaCl and 0.02% DDM/0.004% 
CHS. The beads were then washed three times with binding buffer and resus- 
pended in 400 jl elution buffer (20 mM Tris-HCl, pH 7.4, 100 mM NaCl, 0.02% 
DDM/0.004% CHS and 100mM maltose). The eluates were concentrated and 
incubated at room temperature for 15 min with 2 X loading dye. Samples were 
separated on 12% sodium dodecyl sulfate (SDS)-denaturing polyacrylamide 
gels. Gels were stained with Coomassie R-250, dried at 70°C for 90 min, and 
exposed overnight to a phosphor storage screen. Results were visualized on a 
PhosphorImager (Fuji). 

Assays for the interactions between rhodopsin and arrestin or G protein 
peptide. Interactions between rhodopsin and arrestin were assessed by lumin- 
escence-based AlphaScreen assay (Perkin Elmer), which our group has used 
extensively to determine ligand-dependent protein-protein interactions of nuc- 
lear receptors. The AlphaScreen principle is illustrated in Extended Data Fig. 1c. 
Briefly, biotinylated arrestin was bound to streptavidin-coated donor beads and 
His8-tagged rhodopsin was bound to nickel-chelated acceptor beads. The donor 
and acceptor beads were brought into close proximity by the interactions between 
rhodopsin and arrestin, which were measured in the presence or absence of all- 
trans-retinal (Sigma). When excited by a laser beam of 680 nm, the donor bead 
emits singlet oxygen that activates thioxene derivatives in the acceptor beads, 
which releases photons of 520-620 nm as the binding signal. The experiments 
were conducted with 100 nM of rhodopsin and arrestin proteins in the presence 
of 5 tg ml~! donor and acceptor beads in a buffer of 50 mM MOPS-Na, pH 7.4, 
50 mM NaF, 50 mM CHAPS, and 0.1 mg ml ~ ' bovine serum albumin. The results 
were based on an average of three experiments with standard errors typically less 
than 10%. GaCT-HA (TGGRVLEDLKSCGLEF) and biotinylated GuCT-HA were 
synthesized by Peptide 2.0. For the competition assay, different amounts of 
untagged arrestin or GxCT-HA were added to the reaction to compete with 
tagged arrestin or GuCT-HA for rhodopsin binding. 

Thermal stability assay. The thermal stability assay was performed with the 
thiol-specific fluorochrome N-[4-(7-diethylamino-4-methyl-3-coumariny)phe- 
nyl]maleimide (CPM) as described previously*’. Briefly, 10 ug of protein was 
diluted with dilution buffer (20mM Tris-HCl, pH7.5, 200 mM NaCl, 0.005% 
MNG-3/CHS) to 195 ul, while CPM dye stock (4 mg ml ' in DMSO) was freshly 
diluted to 0.2 mg ml’ in the dark. After 5 min of incubation at room temperature 
for both protein and CPM dye separately, 5 pl of diluted CPM dye was added to 
the protein sample and the protein/dye mix transferred immediately into a sub- 
micro quartz fluorometer cuvette (Starna Cells) and measured in a Cary Eclipse 
spectrofluorometer (Agilent). Assays were performed from 20 °C to 80 °C with a 
slope of 2°C increase per minute at an excitation wavelength of 387 nm and an 
emission wavelength of 463 nm. All data were processed using GraphPad Prism 
and fit using the Boltzmann sigmoidal equation to determine the melting tem- 
perature (T,,,) as inflection point of the melting curves. 

Crystallization. T4L-rhodopsin-arrestin crystals were grown in lipid cubic 
phase (LCP)™. Protein solution (~30 mg ml ') was mixed with monopalmito- 
lein (9.7 MAG, from Nu-Chek Prep, Inc.) containing 10% cholesterol at a 1:1 ratio 
(w/w) using a coupled syringe mixer* and 50 nl boluses of protein-laden LCP 
were dispensed on 96-well glass sandwich plates (Molecular Dimensions or 
Marienfeld-Superior) and overlaid with 0.841 precipitant solutions using 
a Gryphon LCP robot (Art Robbins Instruments) or an NT8-LCP robot 
(Formulatrix). Multiple initial hits were identified by using screens of 30% PEG 
400 in combination of 100 mM or 400 mM salts from the StockOptions Salt kit 
(Hampton Research)”*. Crystals that reached full size (around 10-20 tm) within 
four days at 20°C were harvested from the mesophase and were flash frozen in 
liquid nitrogen without additional cryoprotectant. Crystals used for synchrotron 
data collection were grown in 0.05 M magnesium acetate, 0.05 M sodium acetate, 
pH5.0 and 28% PEG 400. 

Crystals for LCP-SFX were prepared in 100 ul gas-tight Hamilton syringes as 
described***’. About 5 jl of protein-laden LCP in the presence of fivefold molar 
excess of all-trans-retinal was slowly injected into 60 ul mother liquid buffer 
(0.15 M ammonium phosphate, pH 6.4 and 32% PEG 400) using a coupled syr- 
inge mixing device’’. Crystals were grown in several syringes at 20 °C, consoli- 
dated and transferred into the LCP injector® for XFEL diffraction data collection. 
The average crystal size was 5-10 um as determined under a polarized light 
microscope. The phase diagram for 9.7 MAG suggests that this MAG is a suitable 
host lipid for extruding LCP in vacuum at 20°C, where evaporative cooling 
created problems when 9.9 MAG was used as a host LCP lipid”. 

Data collection (synchrotron). A partial 8.0 A synchrotron data set was collected 
at 100 K using an X-ray beam at the wavelength of 1.0 A at the 21 ID-D beam line 
of LS-CAT and at 23-ID-D of GM-CAT at the Advanced Photon Source at 
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Argonne National Laboratory. A full 7.7 A data set was collected from a single 
crystal (~20 1m in size) using 101m beam size and 0.1s exposures per 0.1° 
oscillation with a Pilatus 6M pixel detector at the X10SA beam line at the Swiss 
Light Source. The observed reflections were reduced, merged, and scaled with 
XDS* with statistics shown in Supplementary Table 1a. The L-test plot of the 
7.7 A data set is consistent with a perfectly twinned crystal. 

Data collection (XFEL). LCP-SFX experiments were carried out at the Coherent 
X-ray Imaging (CXI) instrument” at the Linac Coherent Light Source (LCLS) in 
the SLAC National Accelerator Laboratory (Menlo Park, California, USA). X-ray 
pulses of 50 fs duration at a wavelength of 1.3 A (9.5 keV) were attenuated to ~3% 
(3 X 10° photons per pulse) and focused to ~1.5 jim diameter at the interaction 
point using Kirkpatrick-Baez mirrors”. Rhodopsin-arrestin complex crystals in 
LCP were injected across the XFEL beam using an LCP injector** with a 50 um 
diameter nozzle at a flow rate of ~0.2 pl min’. Diffraction patterns were col- 
lected at 120 Hz using the Cornell-SLAC Pixel Array Detector (CSPAD). Over 
5 million data frames were collected corresponding to ~12h of data acquisition 
time. Of these frames, ~0.45% images contained potential crystal hits as iden- 
tified using Cheetah®' (more than 40 Bragg peaks of 1-20 pixels in size and a 
signal to noise ratio better than 6 after local background subtraction). Of the 
potential crystal hits, 18,874 diffraction patterns could be auto-indexed by 
CrystFEL** using a combination of MOSFLM”, XDS* and DirAx®’. An integ- 
ration radius of only two pixels was used to avoid overlapping with neighbouring 
peaks due to the high spot density resulting from the large unit cell dimensions. 
Partial reflections from different crystals in random orientations were merged 
using a Monte Carlo integration across the crystal rocking curve of each 
reflection®’. The resolution was anisotropic with ~3.3A resolution along 
the c*-axis and ~3.8 A resolution along the a*/b* axes. The data used for the 
structure refinement were truncated at 3.8 A/3.8 A/3.3 A using the get_hkl pro- 
gram of CrystFEL*® based on the criteria of data correlation coefficient (CC*), 
which is 0.87 at the highest resolution shell (Supplementary Table 1a). The use 
of CC* of 0.5 as resolution cutoff has been recently recommended for X-ray 
diffraction. The resolution cutoff for several published XFEL structures follows 
this criterion, including the 5HT2, GPCR XFEL structure*®, which has a CC* of 
0.74. The statistics of the final data used in structure refinement are shown in 
Supplementary Table 1b. 

Structure determination. The XFEL data were initially merged according to the 
apparent Laue group of 4/mmm, and molecular replacement searches were per- 
formed in all possible space groups of 4/m and 4/mmm. The best structure 
solution was found in P43. Based on analysis of the Zanuda program”, we deter- 
mined that the most likely space group was P2,2,2, and that the crystals were 
physically twinned. The data were reprocessed with the Laue group of mmm and 
molecular replacement searches were performed in P2)2)2;. This space group 
assignment resulted in the best statistics and map quality out of several possible 
space groups (Supplementary Table 2). 

The crystals appeared to be pseudo-merohedrally twinned based on L-test 
analysis’. Despite the challenge of twinned data, the rhodopsin-arrestin complex 
structure was solved by the molecular replacement method implemented in 
Phaser® using the models of constitutively active rhodopsin, pre-activated 
arrestin, and T4L (PDB codes: 444M”, 4J2Q', and 3SN6%, respectively). Four 
molecules of rhodopsin and four molecules of arrestin were found sequentially by 
molecular replacement search, resulting in four very similar rhodopsin-arrestin 
assemblies. Four T4Ls were also found in the aqueous layer with its C-terminal 
residue in a position to form a covalent bond with the first residue of rhodopsin, 
supporting the correct positioning of T4Ls by molecular replacement. 

The structure was initially refined against the XFEL data without twin law to an 
Reree-factor of ~36% and the model maps from the data were of sufficient quality 
to interpret the overall structure of the rhodopsin-arrestin complex (Extended 
Data Fig. 3). The model then underwent iterated cycles of manual building into 
2F, — F, maps with Coot® and refinement with REFMAC” and the PHENIX”!, 
where rigid body, individual position, group B-factor, and TLS refinements were 
used along with NCS restraints and twin law (k, h, -l). The arrestin residues 70-78 
and 165-175, which were not included in the molecular replacement model, were 
manually placed into the density map. These two regions of arrestin became 
visible because they are either engaged in direct interaction with rhodopsin or 
involved in crystal packing. Regions with poor density were removed from the 
final model, including T4L from the B complex and the N-domain of T4L (resi- 
dues 13-57) from the C complex. We did not observe clear electron density for the 
all-trans-retinal, which was thus not included in the structure. The structure has 
been carefully refined to the final state that has excellent geometry and refinement 
statistics (Supplementary Table 1b). Ramachandran plot analysis indicates that 
100% of the residues are in favourable or allowed regions (no outliers). The final 
structure was validated with MolProbity, which revealed an all-atom-clash score 
of 1.47 and MolProbity score of 1.13”. 
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The real-space correlation coefficients against a 2mF, — DF- map for each 
chain of the structure on a per residue basis using the CCP4 EDSTATS program” 
or the MolProbity program in Phenix indicated an overall good fit between the 
structure and the electron density map. The density fit correlation in Coot was 
low with the structure from Phenix twinned refinement because the map from the 
MTZ file with map coefficients produced by phenix.refine was not in absolute 
scale. This problem was overcome by using the 2F, — F. CNS format map from 
phenix.refine, which generated a normal correlation in Coot, similar to those 
from EDSTATS and MolProbity. All structural figures were prepared using 
PyMOL’.. 

Cell-based assays for rhodopsin-arrestin interactions (Tango assays). For the 
cell-based Tango assay”, we generated fusion constructs consisting of rhodopsin 
(1-321), a tobacco etch virus (TEV) protease cleavage site (TEV site), and the 
transcriptional activator tTA (Rho-TEV site-rTA) as well as of arrestin (10-392) 
and TEV protease (Arr-TEV protease). HTL cells were a gift from G. Barnea and 
R. Axel (Brown University and Columbia University). 10 ng Rho-TEV site-tTA 
construct were transfected together with 10ng Arr-TEV protease plasmid and 
lng of phRG-tk Renilla luciferase expression vector into HTL cells using 
Xtremgene (Roche). One day after transfection, cells were induced by vehicle 
(DMSO) or all-trans-retinal (10 1M) overnight. Cells were harvested and lysed 
in Passive Lysis Buffer (Promega). Luciferase activity was measured using the 
Dual Luciferase Kit (Promega) according to manufacturer’s instructions. 
Mutagenesis. Site-directed mutagenesis was carried out using the QuikChange 
method (Agilent). Mutations and all plasmid constructs were confirmed by 
sequencing before protein expression, MBP-pull down assay, Tango assay, and 
AlphaScreen assay. 

Electron microscopy studies of the rhodopsin-arrestin complex. Samples were 
prepared as previously described’’. Briefly, the sample was applied to a freshly 
glow-discharged carbon coated copper grid and allowed to adhere for 10s before 
being reduced to a thin film by blotting. Immediately after blotting 3 pl of a 1% 
solution of uranyl formate was applied to the grid and blotted off directly. This 
was repeated three times. Data were acquired using a Tecnai F20 Twin transmis- 
sion electron microscope operating at 200kV, with a dose of ~40e7 A~? and 
nominal underfocus ranging from 2 to 3 um. Images were automatically collected 
at a nominal magnification of 62,000 X and pixel size of 0.273 nm. All images 
were recorded with a Tietz F416 4k X 4k pixel CCD camera using Leginon data 
collection software’”’. Experimental data were processed by the Appion software 
package’’, which interfaces with the Leginon database infrastructure. ~6,000 
particles were automatically extracted from 54 electron microscopy micro- 
graphs’* and the particle stack was then aligned and sorted using the XMIPP 
reference-free maximum likelihood alignment”. Several exemplary 2D averages 
are shown in Fig. le, which were derived from class averages for the complex with 
or without T4L, computed from ~14,000 particles selected from 155 electron 
microscopy-micrographs. 

In-cell disulfide bond cross-linking. The open reading frames of full-length 
arrestin with C-terminal Flag tag and full-length rhodopsin with C-terminal 
haemagglutinin (HA) tag were cloned into pcDNA6. Cysteine mutations 
(41 for arrestin and 51 for rhodopsin) were systematically introduced into 
arrestin and rhodopsin in these two DNA vectors. AD293 cells were split one 
day before transfection at 50,000 cells per well in a 24-well plate. Cells were grown 
for one day, then transfected with 100 ng rhodopsin constructs (pcDNA6-rho- 
3HA) plus 100 ng arrestin plasmid (pcDNA6-Arr-3Flag) by Lipofectamine 2000 
(DNA/Lipofectamine 2000 ratio of 1:2) in each well. Cells were grown for 2 days 
after transfection, and were then treated at room temperature with H,O2, which 
was freshly diluted in the cell culture medium to a final concentration of 1 mM. 
After 5min treatment with H,O,, the medium was aspirated and 100 ul of 
CelLytic M (Sigma C2978) were added to each well and the plate was shaken 
for 10 min at room temperature. Cell lysates were transferred to 1.5 ml tubes and 
spun at 16,000g at 4°C for 5 min. The supernatants (10 pil) were mixed with an 
equal volume of 2 X SDS loading buffer (without reducing agents) for 5 min at 
room temperature, and loaded onto a protein gel for western blot analysis. 
Horseradish peroxidase-conjugated anti-Flag (Sigma M2) and anti-HA (Sigma) 
antibodies were used to probe for free and cross-linked arrestin and rhodopsin 
proteins. 

Hydrogen-deuterium exchange mass spectrometry (HDX). HDX was carried 
out as described previously*°, with the following modifications: (1) the solution 
handling and mixing was performed with a LEAP Technologies Twin HTS PAL 
liquid handling robot housed inside a temperature-controlled cabinet held at 4 °C 
and (2) decyl maltose neopentyl glycol (DMNG) was used in place of DDM in the 
exchange buffer. Briefly, all stock solutions and dilutions were made using the 
7TM HDX buffer (50mM HEPES (pH7.5), 150mM NaCl, 2% (v/v) glycerol, 
0.01% (m/v) CHS and 0.05% (m/v) DMNG in either H,O or in D,O for on- 
exchange). All HDX protein stock solutions were prepared at 15 1M in the 7TM 
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HDX H;0 buffer. On-exchange was carried out in triplicate for predetermined 
times (10, 30, 60, 900 and 3,600 s) at 4 °C by mixing 5 ul of stock protein solution 
with 20 pl of D,O on-exchange buffer. Exchange was quenched by adding 25 kl of 
quench solution (100 mM NaH,PO,, 0.02% DMNG, and 15 mM TCEP at pH 2.4) 
to the reaction. Digestion was performed in line with chromatography using an 
in-house packed pepsin column. Peptides were captured and desalted on a C8 
trap. Peptides were then separated across a 5 10 X 1mm Betasil C8 column 
(Thermo Fisher Scientific) with a linear gradient of 12-40% acetonitrile in 
0.3% formic acid over a short 5 min gradient to limit back exchange with the 
solvent. 

Mass spectra were acquired in the range of m/z 300-2,000 at a resolution of 

60,000 for 8 min in positive ion mode on a Q Exactive mass spectrometer 
(Thermo Fisher Scientific) equipped with an ESI source operated at a capillary 
temperature of 225 °C and spray voltage of 3.5 kV. The intensity weighted average 
m/z value (centroid) of each peptide’s isotopic envelope was calculated with the 
Workbench program*! and converted to % deuterium values. Back-exchange 
correction was based on an estimated 70% deuterium recovery and accounting 
for the known 80% deuterium content of the on-exchange buffer. The 
Workbench software used P values lower than 0.05 for two consecutive time 
points to determine significance. Sequence coverage experiments for these pro- 
teins were carried out in the 7TM HDX buffer and LC system described above, but 
with a longer 60 min gradient. 75 pmol of protein were loaded onto the column. 
For sequencing, tandem mass spectra were obtained using data-dependent 
acquisition with 30s dynamic exclusion, where the top five most abundant ions 
in each scan were selected and subjected to CID fragmentation. Each scan was the 
average of 3 microscans under normal scan mode in both MS and MS/MS. 
Peptides were identified by searching spectra against an in-house database using 
the mascot search engine as described previously”. 
Generation of a stable cell line expressing bovine rhodopsin mutant Y74C/ 
C140S/C316S and rhodopsin preparation. The rhodopsin mutant Y74C/ 
C140S/C316S gene’ was PCR-amplified from the pMT vector by using two 
flanking primers containing Nhel and NotI restriction sites and subcloned into 
vector PB-T-PAF®. The insert in the resulting plasmid termed PB-Rho-Y74C was 
verified by automated DNA sequencing. 

Generation of the stable cell line as well as rhodopsin production and purifica- 
tion was performed as described**. HEK293S GnTI cells* were cultured in 
DMEM/F12 (Wisent) supplemented with 10% heat inactivated FBS (Life 
Technologies) and 100 U ml“ penicillin/streptomycin (Life Technologies) and 
incubated at 37 °C, 5% CO. For transfection, cells were plated in 6-well plates at a 
density of ~600,000 cells per well. The following day, cells were co-transfected 
with 4 jg of PB-Rho-Y74C, 0.5 ug of PB-RB® and 0.5 pg of pCyL43°*** using 
JetPrime (PolyPlus, France) reagent following the manufacturer’s instructions 
and medium was replaced by fresh medium 4h later. (Plasmids PB-T-PAF and 
PB-RB were provided by James Rini, University of Toronto, Canada, and pCyL43 
by the Wellcome Trust Sanger Institute, UK). Two days after transfection, cells 
were transferred to a 100-mm tissue culture plate. Dual drug selection was started 
the following day, using 10 pg ml * puromycin (Bioshop, Canada) and 5 pg ml 
blasticidin (Bioshop, Canada) and lasted for 2 weeks. Cells were then transferred 
to larger flasks and finally to pleated roller bottles (Thermo Scientific, USA) 
containing 250 ml of DMEM/F12 supplemented with 10% heat-inactivated FBS 
and 100 Uml' of penicillin and 100 pg ml’ streptomycin. Roller bottles were 
incubated at 37 °C, 5% CO, and rotated at 0.1 r.p.m. Expression was induced after 
5-6 days by replacing medium with 250 ml of fresh medium containing 1 1g ml" 
doxycycline (Bio Basic) and 1 pg ml“! aprotinin (Bioshop Canada). On day 3 
after induction, cells were harvested, using PBS-EDTA with protease inhibitor 
tablet (Roche) to detach cells. Cell pellets were flash frozen in liquid nitrogen and 
stored at —80°C until purification. 

For purification, cell pellets with rhodopsin mutant Y74C/C140S/C316S were 
incubated with 11-cis retinal to reconstitute rhodopsin which was purified and 
spin labelled with 1-oxyl-2,2,5,5-tetramethyl-3-pyrroline-3-methyl metha- 
nethiosulfonate (MTSSL) as previously described’. 

Incorporation of spin labelled rhodopsin into nanodiscs. Nanodiscs were pre- 
pared by mixing cholate solubilized lipid (70% POPC + 30% POPS), scaffolding 
protein MSP1E3D1*, and rhodopsin (in 9)mM B-OG) in a molar ratio of 
140:1:0.1. The OG and cholate were removed from the mixture by dialysis against 
buffer D (20mM MOPS, 150mM NaCl, pH6.8). The nanodiscs were further 
purified by passing them over a nickel NTA column to remove any receptor 
not incorporated into the discs. Upon elution from the nickel column with 
imidazole, the nanodiscs were buffer exchanged into buffer D containing 10% 
glycerol. 

Spin labelling of arrestin mutants. C-terminally truncated mutants of arrestin, 
lacking endogenous cysteines, have been shown to bind to non-phosphorylated 
rhodopsin®. Single-cysteine mutants in this background were expressed in E. coli, 


as described®, spin labelled in buffer D overnight using a tenfold molar excess of 
MTSSL. The base mutant also contained two alanine substitutions (F85A and 
F197A) which have been shown to disrupt arrestin-1 oligomerization”. Non- 
covalently bound spin label was removed from the sample by extensive washes 
with buffer D using Amicon 10 kDa concentrators. 

DEER spectroscopy of the rhodopsin-arrestin complex. Preparations of rho- 
dopsin (Y74241C) and bovine arrestin (S60C, V139C, and L240C, which corre- 
spond to the mouse arrestin T61C, V140C, and $241C) and their spin labelling 
were performed as described previously*!. For DEER measurements, the spin- 
labelled proteins were mixed in 1:1 ratio in the dark and loaded into quartz 
capillaries (1.5 mm internal diameter and 1.8 mm outer diameter). The samples 
were irradiated for 30 s within the capillaries using a tungsten light source with a 
500 nm cutoff filter. Immediately after irradiation, the samples were flash frozen 
in liquid nitrogen, and loaded into an EN 5107D2 resonator for Q band DEER 
measurements. Measurements were performed at 80K on a Bruker Elexsys 
580 spectrometer with a Super Q-FTu Bridge. A 36-ns n-pump pulse was applied 
to the low field peak of the nitroxide field swept spectrum, and the observer 1/2 
(16ns) and m (32ns) pulses were positioned 50 MHz (17.8G) upfield, which 
corresponds to the nitroxide centre line. Model-free distance distributions were 
obtained from the raw dipolar evolution data using the LabVIEW (National 
Instruments) program “LongDistances” that can be downloaded from http:// 
www.biochemistry.ucla.edu/biochem/Faculty/Hubbell/. 

To estimate the median distances, the distance distributions were integrated 
and normalized to the maximum amplitude. The median distance was estimated 
as that corresponding to 0.5 of the integrated intensity. The modelled distances 
between nitroxide spin labels are based on the crystal structure of the rhodopsin- 
arrestin complex. R1 nitroxide side chains were modelled into the structure using 
common RI rotamers”””?. 

9.7 MAG/water temperature-composition phase diagram. The phase diagram 
was constructed based on small- and wide-angle X-ray scattering measurements 
made in the heating direction. Sample preparation and X-ray scattering measure- 
ments and analysis were as previously described*”’. The phases identified include 
the lamellar crystalline (Lc) or solid phase, the fluid isotropic (FI) or liquid phase, 
and the following liquid crystalline phases: lamellar liquid crystal (L,,), cubic-Ia3d 
and cubic-Pn3m. A separate aqueous phase observed in equilibrium with the solid 
or liquid crystalline phases is indicated by Aq. The phase diagram shows that the 
solid Lc phase stabilizes under equilibrium conditions below ~8 °C. The latter is 
some 10 °C below that observed with 9.9 MAG (monoolein)” and is similar to what 
was found with 7.9 MAG’. This low solidification temperature enabled use in the 
current project of 9.7 MAG as the host lipid for LCP-SFX data collection in an 
evacuated sample chamber at 20 °C, where evaporative cooling created problems 
for measurements with 9.9 MAG but not with 7.9 MAG*. The maximum water 
carrying capacity of 9.7 MAG resides at ~50%(w/w) water, which is considerably 
greater and smaller than that of 9.9 MAG” and 7.7 MAG”, respectively. These 
observations indicate that the cubic mesophase of 9.7 MAG has larger aqueous 
channels compared to 9.9 MAG that are more like those of 7.7 MAG. This is 
consistent with 9.7 MAG supporting the growth of rhodopsin-arrestin-T4L crys- 
tals where the complex has sizable extra-membrane bulk best accommodated in a 
large aqueous channel. This parallels the rational use of 7.7 MAG as a host lipid for 
the B.AR-Gs complex crystallization and structure determination®. 

Molecular modelling of the full-length rhodopsin-arrestin complex. Energy- 
based conformational modelling of the rhodopsin-arrestin complex was per- 
formed with the ICM-Pro molecular suite”, using a global energy optimization 
procedure similar to the one described recently for modelling of the full-length 
complex of CRFR1”*. Protein sequences of human rhodopsin and mouse arrestin 
were obtained from the Uniprot database (http://www.uniprot.org/). Starting 
from the crystallographically determined structure of the complex, the modelling 
procedure was used to add unresolved residues of the C terminus (residues 327- 
345) of the human rhodopsin structure, as well as missing residues in the arrestin 
loop (residues 340-342) of the mouse arrestin structure. The final model did not 
include the last 3 residues of rhodopsin (346-348), which lack well-defined cross- 
linking contacts and appear flexible. Initial conformations of the short loop in 
arrestin were predicted with the fast “build model” ICM algorithm, followed by 
extensive energy optimization in internal coordinates. Conformational optimiza- 
tion of the rhodopsin C-terminal peptide was guided by soft pairwise harmonic 
distance restraints derived from disulfide crosslinking data. The restraints intro- 
duced between CB atoms of the crosslinked residues were graded according to the 
crosslinking strength listed in Supplementary Table 5, from very strong, with the 
penalty function starting at 5 A distance, to medium at 7 A distance and very weak 
at 12 A. The C-terminal peptide conformation and conformation of the contact 
side chains of the arrestin were optimized to convergence (3 independent simula- 
tions of 10° steps) using global optimization procedure in internal coordinates 
with improved conformational energy terms for protein and peptides”. A special 
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backbone closure sampling procedure was applied to the loop regions to allow 
efficient optimization. The global optimization runs were executed in parallel ona 
Linux multicore server resulting in similar best energy conformations (<3 A 
r.m.s.d.; root mean squared deviation) for the C-terminal peptide residues. 

The best energy-optimized conformation of this region suggests that the 
extended C terminus peptide runs antiparallel along the N-terminal B-strand 
of arrestin. This conformation of the C terminus satisfied all 17 medium to strong 
disulfide crosslinking restraints for this region (Supplementary Table 5), while 
making a number of specific polar interactions and salt bridges of D331, E332 and 
E341 side chains with arrestin basic residues (Extended Data Fig. 5). These 
modelling results suggest that although the rhodopsin C terminus is rather flex- 
ible, some low energy conformations may be preferable even in non-phosphory- 
lated rhodopsin. Moreover, independent modelling of the complex with 
phosphorylated serine residues Ser334, Ser338 and Ser343 in the C terminus of 
rhodopsin resulted in a similar conformation of this domain. The interactions 
within phosphorylated complex, however, are greatly enhanced by as many as 
seven additional salt bridges between negatively charged phosphates and the 
positively charged lysine and arginine residues within the N- terminal domain 
of arrestin (Extended Data Fig. 11). 
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Extended Data Figure 1 | Constitutively active rhodopsin interacts with 
arrestin and GaCT-HA. a, SDS-PAGE of N-terminally MBP-tagged wild- 
type and mutant rhodopsin. b, Non-cropped versions of the pull-down assay 
gels shown in Fig. 1b. The interactions between mouse wild-type arrestin and 
human wild-type or E113***Q rhodopsin are very weak. In contrast, the 
interaction between constitutively active rhodopsin (E113*8Q/M257°Y) 
and pre-activated L374A/V375A/F376A arrestin (3A arrestin) is strong and is 
further increased in the presence of 10 1M all-trans-retinal. Input: 5% of the 
binding reaction. Bottom panels show the rhodopsin loading controls. 

c, Schematic representation of the AlphaScreen assay. d, AlphaScreen 
binding assay between E113°7°Q/M257°°Y rhodopsin and GaCT-HA 
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(TGGRVLEDLKSCGLHF) in the presence and absence of 5 uM all-trans-retinal. 
The two left columns show the controls with ‘peptide only’ and ‘rhodopsin 
only’. (n = 3, error bars, s.d.). e, Determination of the affinity of the interaction 
between rhodopsin E113°*°Q/M257°*°Y and GuCT-HA by homologous 
competition. Hiss-MBP-rhodopsin mutant protein was immobilized on 
Ni-acceptor beads and biotinylated GxCT-HA on streptavidin donor beads. 
Binding between rhodopsin and arrestin brings donor and acceptor beads into 
close proximity, resulting in the indicated binding signal. Non-biotinylated 
GaCT-HA competed for the interaction with an ICs9 of ~700 nM (n = 3, error 
bars, s.d.). 
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Extended Data Figure 2 | Purification and crystallization of T4L-rhodopsin- 
arrestin. a, Purification of the T4L-rhodopsin-arrestin (T4L—Rho-Arr) 
complex. Hiss-MBP-MBP-T4L-Rho-Arr complex was first purified by 
amylose column chromatography (lane 1). The Hiss~-MBP-MBP tandem tag 
was then released by cleavage with 3C protease (lane 2) and removed by binding 
to Ni-NTA beads to recover pure T4L-rhodopsin-arrestin (T4L—Rho-Arr) 
protein (lane 3). b, Analytical gel filtration profile of the T4L-rhodopsin- 
arrestin complex. T4L-rhodopsin-arrestin eluted mostly as monomers with 


a small proportion of oligomers. The molecular weights of protein standards 
are indicated at the top. c, Thermal stability shift analysis of T4L-rhodopsin-— 
arrestin. T4L-rhodopsin-arrestin is relatively stable with a Tm of 59°C. d, e, 
Crystals of T4L-rhodopsin-arrestin in lipid cubic phase under bright-field 
illumination (d) and polarized light (e). f, X-ray diffraction pattern of a T4L- 
rhodopsin-arrestin crystal recorded at LS-CAT of APS. The green ring 
indicates the position of reflections at 8.0 A resolution. 
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Extended Data Figure 3 | Electron density map for the overall complex 

and the key interfaces based on the XFEL data. a, A 2F, — F, electron density 
map contoured at lo of the arrestin finger loop, which forms the key interface 
with TM7 and helix 8.b, A 2F, — F, electron density map contoured at 1o of the 
loop between TM5 and TM6, which forms the key interface with the B-strand 
following the finger loop. c, A 3,000 K simulated annealing omit map (2F, — F, 


electron density map contoured at 1c) calculated from the 3.8 A/3.8 A/3.3 A 
XFEL data supports the overall arrangement of the rhodopsin-arrestin 
complex. In all panels, the complex structure is shown with rhodopsin coloured 
in green and arrestin in brown. d, The C-loop with a 2F, — F, composite omit 
map at 1o calculated from the 3.8 A/3.8 A/3.3 A truncated XFEL data. Key 
residues are labelled. 
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Extended Data Figure 4 | Structure similarity of the four rhodopsin- 
arrestin complexes in the asymmetric units and the interface between 
rhodopsin and arrestin. a, Two 90° views of the superposition of the four 
rhodopsin-arrestin complexes are shown as cartoon representation. The four 
complexes have an r.m.s.d. of less than 0.5 A in the Co: atoms of rhodopsin and 
arrestin. b, Close-up view of arrestin-binding sites in rhodopsin. The four 
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arrestin-binding sites (P1-P4) are highlighted in brown on the rhodopsin 
surface. The rhodopsin C-terminal tail/arrestin interface (P4) is based on 
computational modelling and disulfide cross-linking data. c, Rhodopsin- 
binding sites in arrestin. The four rhodopsin-binding sites (P1-P4) are 
highlighted in green on the arrestin surface. 
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Extended Data Figure 5 | Conformational modelling of the rhodopsin- arrestin residues identified by disulfide crosslinking. c, Same as in b, but 
arrestin full length complex. a, An overview of the computational model. showing predicted hydrogen bonding and ionic interactions for the C-terminal 
b, Predicted interactions of the rhodopsin C terminus with arrestin, showing _ residues of rhodopsin. 

strong to medium pairwise restraints between CB atoms of rhodopsin and 
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Extended Data Figure 6 | Dynamics of free 3A arrestin and rhodopsin- by mass spectrometry and the colours of the bars indicate the relative decrease 
bound arrestin determined by HDX. a, HDX perturbation map between in deuterium exchange (colour code at bottom). b, The thermal stability of free 
rhodopsin-bound arrestin and free arrestin, which is derived from the 3A arrestin and the rhodopsin-arrestin complex shows that the rhodopsin- 


difference in the HDX rate between rhodopsin-bound arrestin and free arrestin. arrestin complex is more stable than free 3A arrestin. 
The bars below the arrestin sequence represent the peptide fragments resolved 
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rhodopsin-arrestin interface. a, Cartoon illustration of the Tango assay for 


Extended Data Figure 7 | Cell-based Tango assays to validate the 


rhodopsin-arrestin interactions in cells. b, c, Mutations of key arrestin (b) and 
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Extended Data Figure 8 | Control experiments for disulfide bond cross- 
linking specificity. a, The product of the cross-linking reaction of finger loop 
residue G77C with N310”°’C of TM7 was confirmed by western blots using 
anti-Flag antibody (which detects arrestin—Flag fusion) and anti-HA antibody 
(which detects rhodopsin-HA fusion). The cross-linked products are marked 
with arrow heads, and free-arrestin and free-rhodopsin are indicated by 
asterisks. Arrestin (3A) and rhodopsin (4M) without cysteine mutations do not 
form cross-linked products. b, The cross-linked product of finger loop residue 
G77C with N310’°’C of TM7 was sensitive to treatment with reducing 


ARTICLE 


agents, indicating the cross-linking is mediated through disulfide bond 
formation. ¢, A close-up view of arrestin finger loop residues M76C and G77C 
and their cross-linking with rhodopsin, which shows that G77C was specifically 
cross-linked to N310’°’C of TM7 and Q312°49 of helix 8, and M76C was 
cross-linked to N310”°”C of TM7 and Q312°"C of helix 8, but not to other 
residues. d, Structure and cross-linking of finger loop N-terminal residues 
Q70C, E71C, and D72C of arrestin to T70C and K67C from ICL1 of rhodopsin. 
e, Structure and cross-linking of arrestin back loop residues R319C and 
T320C to Q237'°C from TMS of rhodopsin. 
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Extended Data Figure 9 | Structure comparison of the arrestin-bound (green) and G, protein-bound B,-adrenergic receptor (light yellow). c, Overlays 
rhodopsin with the B,-adrenergic receptor in complex with G, protein(PDB _ of arrestin-bound rhodopsin (green) with inactive rhodopsin (pink) reveals 
code 3SN6) and the inactive rhodopsin (PDB code 1F88). a, Superposition of — specific conformational changes in each TM helix. The arrows indicate outward 
arrestin-bound rhodopsin (green) with G, protein-bound , adrenergic movements of TM helices. d, r.m.s.d. of Ca atom differences between arrestin- 
receptor (light yellow). The major conformational changes are indicated by bound rhodopsin and inactive rhodopsin shows the large conformational 
arrows. b, An intracellular view of a superposition of arrestin-bound rhodopsin changes in TM5 and TM6. 
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Extended Data Figure 10 | Structure of rhodopsin-bound arrestin and its 
comparison with inactive and ‘pre-activated’ arrestin. a, b, The charge 
potential surface map of rhodopsin from the rhodopsin-arrestin bound 
complex shows that the cytoplasmic rhodopsin TM bundle surface is positively 
charged (blue) whereas its C-terminal tail is negatively charged (red). 

c, d, Charged surface of arrestin from the rhodopsin-arrestin bound complex 
shows that the arrestin finger loop is negatively charged (red) and its 
N-terminal B-strand interface is positively charged (blue). The charge 
distribution in rhodopsin and arrestin is complementary to each other for their 
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interactions. e, Comparison of rhodopsin-bound arrestin (light blue) with 
inactive arrestin (brown, PDB code: 1CF1), showing an ~20° rotation between 
the N- and C- domains of arrestin. f, Comparison of rhodopsin-bound arrestin 
(dark brown) with pre-activated arrestin (light brown, PDB code 4J2Q), 
showing conformational changes in the finger loop, which adopts an «helical 
conformation (cyan) in the complex. The extended finger loop conformation 
would protrude into the rhodopsin TM bundle and is not compatible with 
receptor binding. Computational model for the full rhodopsin-arrestin 
complex is shown in panels b and d. 
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Extended Data Figure 11 | A computational model of phosphorylated 
rhodopsin in complex with arrestin and salt sensitivity of the rhodopsin- 
arrestin interaction. a-d, An overall view (a) and close-up views (b-d) of the 
computational model of the rhodopsin C-tail with phospho-serine at positions 
334, 338 and 343 in complex with arrestin. e, The AlphaScreen control 
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(biotin—Hiss) shows much less salt sensitivity than the interaction between 
His-tag—rhodopsin and biotin arrestin, which is very sensitive to salt, with an 
ICs of around 200 mM NaCl (100 mM NaCl added to 100 mM salt of the 
original assay buffer) (n = 3, error bars, s.d.). 
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Extended Data Figure 12 | A positive charge property is commonly found — 5HT»,, PDB code 41B4 (e). Positive and negative charge potentials are shown in 


at the cytoplasmic side of GPCRs. a-e, Surface charge potential of the blue and red, respectively. f, Sequence alignment of the finger loop region 
cytoplasmic side of selected agonist bound GPCR structures: 8B, AR, PDB code _ highlighting negatively charged residues (shown in red), which are conserved 
2Y02 (a); B2xAR, PDB code 3PDS (b); Aza adenosine receptor, PDB code in all subtypes of arrestins. 


3QAK (c); serotonin receptor 5HT},, PDB code 4IAR (d); serotonin receptor 
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Extended Data Figure 13 | A possible role of the arrestin C-edge in lipid membrane layer, where the conserved hydrophobic side chains are shown. The 
binding. a, b, The asymmetric assembly of the rhodopsin-arrestin complex in _ figure was made using the computational model for the full rhodopsin-arrestin 
the presence of a lipid membrane bilayer, showing the C-edge of arrestin complex. 

dipping into the lipid layer. c, d, A close-up view of the C-edge of arrestin in the 


©2015 Macmillan Publishers Limited. All rights reserved 


od ese 


doi:10.1038/nature14619 


Magnetospherically driven optical and radio aurorae 
at the end of the stellar main sequence 


G. Hallinan’, S. P. Littlefair?, G. Cotter®, S. Bourke’, L. K. Harding’, J. S. Pineda!, R. P. Butler’, A. Golden’, G. Basri’, J. G. Doyle’, 
M. M. Kao!, S. V. Berdyugina’, A. Kuznetsov", M. P. Rupen"" & A. Antonova!? 


Aurorae are detected from all the magnetized planets in our Solar 
System, including Earth’. They are powered by magnetospheric 
current systems that lead to the precipitation of energetic electrons 
into the high-latitude regions of the upper atmosphere. In the case 
of the gas-giant planets, these aurorae include highly polarized 
radio emission at kilohertz and megahertz frequencies produced 
by the precipitating electrons’, as well as continuum and line emis- 
sion in the infrared, optical, ultraviolet and X-ray parts of the 
spectrum, associated with the collisional excitation and heating 
of the hydrogen-dominated atmosphere’. Here we report sim- 
ultaneous radio and optical spectroscopic observations of an object 
at the end of the stellar main sequence, located right at the bound- 
ary between stars and brown dwarfs, from which we have detected 
radio and optical auroral emissions both powered by magneto- 
spheric currents. Whereas the magnetic activity of stars like our 
Sun is powered by processes that occur in their lower atmospheres, 
these aurorae are powered by processes originating much further 
out in the magnetosphere of the dwarf star that couple energy into 
the lower atmosphere. The dissipated power is at least four orders 
of magnitude larger than what is produced in the Jovian magneto- 
sphere, revealing aurorae to be a potentially ubiquitous signature 
of large-scale magnetospheres that can scale to luminosities far 
greater than those observed in our Solar System. These magneto- 
spheric current systems may also play a part in powering some of 
the weather phenomena reported on brown dwarfs. 

LSR J1835 + 3259 is a dwarf star of spectral type M8.5 with a bolo- 
metric luminosity 10 3 times that of the Sun, located at a distance of 
5.67 + 0.02 pc (ref. 4). It is positioned close to a transition in magnetic 
activity near the end of the main sequence on the Hertzsprung—Russell 
diagram, where the amount of energy produced in X-rays (indicative 
of the presence of a magnetically heated corona) drops by two orders of 
magnitude over a small range in spectral type’. Simultaneously, rapid 
rotation becomes ubiquitous, indicating a dearth of stellar-wind- 
assisted magnetic braking®. Together, these results suggest that the 
coolest stars and brown dwarfs possess a comparatively cool and neut- 
ral outer atmosphere relative to higher-mass (earlier spectral type) 
dwarf stars like our Sun. Consistent with this picture, LSR 
J1835 + 3259 is a rapid rotator with a period of rotation of just 2.84 
h, and previous deep observations by the Chandra X-ray Observatory 
have failed to detect any X-ray emission associated with the presence of 
a magnetically heated corona’. 

LSR J1835 + 3259 has previously been observed to produce highly 
circularly polarized radio emission, periodically pulsed on a rotation 
period of 2.84 h (ref. 8). Since their initial detection as a new population 
of radio sources’, similar behaviour has been observed for a number of 
very-low-mass stars and brown dwarfs spanning the spectral range M8 
to T6.5'°"*. In some cases, periodic variability has also been detected in 


broadband optical photometric bands and the Ha line'*"*. Together, 
these characteristics are unlike anything observed for higher-mass 
main-sequence stars’”. 

We pursued spectroscopic data in radio and optical bands to invest- 
igate a possible relationship between the periodic radio, broadband 
optical and Balmer line emission. We used the Karl G. Jansky Very 
Large Array (VLA) radio telescope to produce a dynamic spectrum of 
the periodic radio emission from LSR J1835 + 3259. Simultaneously 
we conducted time-resolved optical spectrophotometry using the 
Double Spectrograph (DBSP) on the 5.1-m Hale telescope at the 
Palomar Observatory. Follow-up observations, involving an additional 
7h of more sensitive time-resolved optical spectrophotometry, were 
carried out using the Low Resolution Imaging Spectrometer (LRIS) on 
the 10-m Keck telescope. 

The broadband dynamic radio spectrum produced with the VLA 
reveals a number of distinct components periodically repeating with a 
2.84-h rotation period (Fig. 1). The observed periodic features some- 
times exhibit a cut-off in frequency, are 100% circularly polarized and 
are of very short duration relative to the rotation period, the latter 
implying sharp beaming. These properties are consistent with electron 
cyclotron maser emission produced near the electron cyclotron fre- 
quency at the source of the radio emission (Vy, © 2.8  Beausss Where 
v is frequency and B is magnetic field strength), a coherent emission 
process responsible for planetary auroral radio emission’. From the 
dynamic spectrum of the radio emission from LSR J1835 + 3259, we 
can infer magnetic field strengths in the source region of the emission 
of 1,550-2,850 G, close to the maximum photospheric magnetic field 
strengths found in late-spectral-type M7-M9 dwarfs'*. The proximity 
of the radio source close to the photosphere, together with the persist- 
ent nature of the periodic radio emission, requires a current system 
driving a continuously propagating electron beam in the lower atmo- 
sphere of the dwarf. 

The simultaneous optical spectroscopic data collected with the Hale 
telescope are also modulated on the 2.84-h rotational period (Fig. 1). 
This behaviour is confirmed with the follow-up Keck data, which has a 
higher signal-to-noise ratio, and for which the same periodic modu- 
lation is observed at the same amplitude. This modulation is clearly 
present in both spectral line emission, including the Balmer lines, and 
the broadband continuum optical emission of the dwarf. Most notably, 
the Balmer line emission and the nearby continuum vary in phase 
(Fig. 2), indicating a co-located region of origin. 

We find that the surface feature responsible for the periodic vari- 
ability in the optical spectrum can be modelled as a single component 
approximated as a blackbody of temperature T ~ 2,200 K, with surface 
coverage of <1% (Fig. 2). We attribute this blackbody-like spectrum to 
an optically thick region with the dominant opacity contributed by the 
negative hydrogen ion (H_). H” is the dominant source of solar 
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a : ; Figure 1 | Simultaneous optical and radio 
| ; | periodic variability of LSR J1835 + 3259. 
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optical continuum opacity, but is superseded by the opacity due to the 
bound-bound transitions of molecular lines, such as TiO, for cool 
M dwarfs, owing to the scarcity of free electrons available to form 
the H” ion”. In the solar case, ionized metals fulfil the role of the 
electron donors necessary to sustain a H” population. Since there are 
essentially no electron donors in a thermal gas at T = 2,200 K, another 
population of electron donors is required. 

The periodically variable radio, Balmer line and optical continuum 
emission detected from LSR J1835 + 3259 can be explained by a single 
phenomenon, specifically a propagating electron beam striking the 
atmosphere, powered by auroral currents. Integrating over time and 
frequency, we can determine that the highly circularly polarized radio 
emission from LSR J1835 + 3259 contributes at least 10'° W of power, 
which requires 10'’-10'" W of power available in the electron beam 


for dissipation in the atmosphere, assuming an efficiency of 
107-10 * for the radio emission”. We note that this amounts to 
~10 °-10 * of the bolometric luminosity of the dwarf. Collisional 
excitation of the neutral hydrogen atmosphere by the precipitating 
electrons leads to subsequent radiative de-excitation, resulting in 
Balmer line emission with an average power of 2.5 X 10’” W, consist- 
ent with the energy budget of the precipitating electron beam. This is 
similar to the main Jovian auroral oval, where radio emission from 
electron beams contributes only 10° * of the auroral power, with the 
bulk of the power produced in the infrared (H**; thermal), far ultra- 
violet (Lyman and Werner band H, emission) and optical (Balmer 
line) owing to dissipation of the electron beam energy in the atmo- 
sphere’. Similar ratios are also observed in Io’s magnetic footprint in 
the Jovian atmosphere’®. 


oe b Figure 2 | Modelling the optical variability of 
; 1760,  * ‘Ha (6,560-6,570) A LSR J1835 + 3259. a, The models we adopted for 
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as expected for our model. c, The amplitude of 
variability as a function of wavelength (black) with 
20 uncertainties (shaded grey region), derived from 
the ‘low and ‘high’ states for the auroral emission, 
defined as the red- and blue-shaded regions 
respectively in b. The variability predicted by the 
model presented in a is shown in red (see Methods 
for details). We note that the line emission is not 
represented in the model. 
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We propose that the same electron beam is also causally responsible 
for the co-located optical broadband variability. In this model, the 
associated increased electron number density contributes excess free 
electrons, leading to an increase inthe H population, in a process that 
has previously been invoked for white-light solar flares’. This results 
in an optically thick layer at a higher altitude, and thus lower temper- 
ature, than the photosphere. Despite the lower temperature, the 
absence of deep absorption bands in the spectrum results in a bright 
feature in optical bands, responsible for the broadband optical vari- 
ability. However, in regions of the dwarf spectrum devoid of absorp- 
tion bands, particularly towards the redder end of the spectrum, there 
is a reversal, with the auroral feature appearing dimmer than the 
surrounding photosphere. This results in some optical wavelengths 
displaying lightcurves that are in anti-phase to other wavelengths 
(Fig. 2), as previously seen in the multi-band photometry of an 
M9 dwarf’. 

Our observations point to a unified model involving global auroral 
current systems to explain the periodic radio, broadband optical and 
Balmer line emission detected from LSR J1835 + 3259, as well as from 
other low-mass stars and brown dwarfs. Extending to cooler objects, it 
is notable that radio emission has now been detected from a brown 
dwarf of spectral class T6.5 (~900 K)"’. That brown dwarf is also 
notable for hosting weak Ha emission, one of only three such T dwarfs 
confirmed to emit Balmer line emission’. The similarities with LSR 
J1835 + 3259 suggest that the auroral mechanism operates robustly 
well into the regime occupied by the coolest brown dwarfs of spectral 
types late L and T. 

It is also notable that a large degree of variability has been observed 
in the infrared in this spectral regime, particularly near the transition 
between the L and T spectral classes’. This variability explicitly 
requires variation in temperature or photospheric opacity across the 
surface of these brown dwarfs”, which has been attributed to the 
spatially inhomogeneous distribution of condensate clouds in their 
atmospheres, effectively a manifestation of weather. This is supported 
by the mapping of such cloud patterns on the surface of the recently 
discovered Luhman 16B™. We speculate that the magnetospheric cur- 
rents powering aurorae in brown dwarfs may also drive some of the 
more extreme examples of the weather phenomenon in brown dwarfs, 
possibly via downward propagating electron beams that modify atmo- 
spheric temperature and opacity in the same fashion as has been 
shown for LSR J1835 + 3259. 

The nature of the electrodynamic engine powering brown-dwarf 
aurorae is yet to be determined. For Solar System planets, this electro- 
dynamic engine can be (1) magnetic reconnection between the plan- 
etary magnetic field and the magnetic field carried by the solar wind 
(for example, Earth and Saturn)”», (2) the departure from co-rotation 
with a plasma sheet residing in the planetary magnetosphere (for 
example, Jovian main auroral oval)**”’, or (3) interaction between 
the planetary magnetic field and orbiting moons (for example, the 
Jupiter—Io current system)**. Of these, the sub-corotation of magneto- 
spheric plasma on closed field lines, in turn powering magnetosphere- 
ionosphere coupling currents, has been put forward as a plausible 
model that can be extrapolated from the Jovian case to the brown- 
dwarf regime”**. This model requires a continuously replenished 
body of plasma within the magnetosphere. This mass-loading can be 
achieved in multiple ways, including interaction with the interstellar 
medium, the sputtering of the dwarf atmosphere by auroral currents, a 
volcanically active orbiting planet or magnetic reconnection at the 
photosphere. Alternatively, considering the case of an orbiting plan- 
etary body embedded within the magnetosphere of LSR J1835 + 3259, 
a simple scaling from the Jupiter-Io system indicates that an Earth- 
sized planet (magnetized or unmagnetized) orbiting within 20 radii 
(<30 h orbital period) of LSR J1835 + 3259 will generate a current 
sufficient to power the observed aurorae. However, we note that the 
observed rotational modulation of the radio emission would require 
a substantially asymmetric magnetic field configuration for LSR 
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J1835 + 3259. Indeed, the aurorae would display modulation on both 
the rotational and orbital period, which may be consistent with the 
large degree of variability reported for the radio pulsed emission from 
these objects. 

A possible method of resolving the nature of the electrodynamic 
engine lies with the strong rotational modulation of the Balmer line 
emission of LSRJ1835 + 3259, which implies that the auroral feature is 
not axisymmetric relative to the rotation axis of the dwarf. This should 
result in a variation in the width, intensity and velocity structure of 
line profiles with rotation that can be used to help map the aurora (for 
example, auroral oval versus polar cap), analogous to Doppler 
imaging, which in turn will provide information about the location 
of the electrodynamic engine. 

Our results imply that the available power for generating aurorae on 
brown dwarfs is dependent on magnetic dipole moment and rotation, 
and may be weakly coupled to other physical characteristics, such as 
bolometric luminosity. This accounts for the continuous presence of 
auroral radio emission at similar luminosity from spectral type M8 
through to T6.5, despite a decrease of two orders of magnitude in 
bolometric luminosity over the same spectral range®. This suggests 
that aurorae may be present at detectable levels on even the faintest 
T and Y dwarfs and bodes well for searches for similar emission from 
exoplanets. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Radio data reduction. Data were reduced using the Common Astronomy 
Software Applications (CASA Release 4.1.0; http://casa.nrao.edu/) and 
Astronomical Image Processing System (AIPS;_ http://www.aips.nrao.edu/ 
index.shtml) packages. The amplitude and phase of the data were calibrated using 
short observations of the quasars QSO J1850 + 284 and 3C286 that were inter- 
spersed throughout the 6-h observation of LSR J1835 + 3259. Bad data, particu- 
larly those contaminated by radio-frequency interference, were flagged. The tasks 
fixvis/UVFIX were used to shift the source to the phase centre and the tasks clean/ 
IMAGR were used to image the data. The tasks uvsub/UVSUB were used to 
subtract the source models for nearby background sources from the visibility data. 
The real part of the complex visibilities as a function of time and frequency for each 
polarization were then exported from CASA and AIPS and plotted to produce the 
dynamic spectrum of LSR J1835 + 3259 shown in Fig. 1. 

We observed LSRJ1835 + 3259 using 2 X 1-GHz sub-bands spanning frequen- 
cies of 4.3-5.3 GHz and 6.9-7.9 GHz. Two full rotation periods were captured 
during the 5.7-h observation. We show in Fig. 1 the dynamic spectrum detected by 
the right circularly polarized feeds of the VLA antennas. The original data has a 
time and frequency resolution of 1 s and 2 MHz respectively, but is binned and 
smoothed to produce the data shown in Fig. 1, with the y axis truncated to remove 
the large gap between observing bands. Periodic features of 100% circularly polar- 
ized radio emission occupy ~3% of the dynamic spectrum, allowing us to infer 
that it is beamed from the source in an angular emission pattern that occupies a 
similar fraction of a 4m steradian sphere. Studies of electron cyclotron maser 
emission from planetary magnetospheres have revealed the emission to be beamed 
in a hollow cone with walls a few degrees thick and a large opening angle (typically 
~70°-90°) relative to the local magnetic field’. This is consistent with our data and 
accounts for the offset in phase of the radio features relative to the optical peak. 
Integrating over time and frequency and inferred beaming pattern, we determine 
the auroral power contributed by the polarized radio emission to be ~10'° W. 

We detect at least six distinct components in the dynamic spectrum for LSR 
J1835 + 3259, each of which is probably powered by a distinct local-field-aligned 
current. The relationship between these individual current systems within the 
large-scale magnetosphere will probably remain uncertain until the nature of 
the electrodynamic engine is established. Although a number of these components 
exhibit a cut-off in emission rising to higher frequencies, there are still components 
present all the way to the top of the band, implying that the true cut-off in emission, 
associated with the largest-strength magnetic fields near the surface of the dwarf, 
was not captured. 

Hale optical data reduction. The data from the DBSP on the Hale telescope were 
reduced using standard techniques with the aid of the IRAF software suite. First, 
bias level was subtracted from the raw frames. Then pixel-to-pixel gain variations 
in the charge-coupled device (CCD) camera were corrected by normalizing against 
exposures taken with the DBSP internal broadband lamp and the illumination 
function of the long slit was corrected by normalizing against twilight sky expo- 
sures. Next, the {x, y} pixels of the CCD were transformed to a rectilinear solution, 
{wavelength, sky position}, using the DBSP internal arc lamps. Finally, the night 
sky emission was subtracted by fitting a fourth-order polynomial to each column 
along the sky direction, with the stars in the frame masked out, and cosmic rays 
rejected via sigma-clipping. A tramline extraction was then used to make one- 
dimensional spectra of the target and reference stars. 

Keck optical data reduction. Data were reduced using the LOW-REDUX pipe- 
line (http://www.ucolick.org/~xavier/LowRedux). Bias frames were constructed 
by median stacking five individual bias frames. Non-uniform pixel response was 
removed by dividing by a dome flat field produced by stacking seven individual flat 
fields together. Individual objects were located on the slit, and an optimal extrac- 
tion routine*! was used to extract the object spectra on each frame. Wavelength 
calibration was performed using fits to arc line spectra taken at the start of the 
night, which gave a dispersion of 2.4 A per pixel (red) and 3.2 A per pixel (blue), 
and root-mean-square values of 0.7 pixels (red) and 0.4 pixels (blue). Each object 
spectrum was corrected for flexure using fits to night sky lines. Flexure corrections 
ranged from —2 to +2 pixels. Flux calibration was performed via a high-order 
polynomial fit to the flux standard Feige-110, with the Balmer lines masked out. 

The data were corrected for light falling outside of the spectrograph slit using the 
additional comparison stars observed. Since the wavelength coverage of each 
object differs slightly owing to the location of slits in the object mask, slit loss 
corrections were determined as follows. Each comparison star was divided by an 
average of all the frames, to remove the spectral shape. The resulting spectrum was 
fitted with a first-order polynomial to give a series of wavelength-dependent slit 
loss corrections for each frame. This polynomial was then re-binned onto the same 
wavelength scale as the target spectrum. Not all comparison star spectra were used 
to correct for slit losses in the target spectrum. Instead, a master slit loss correction 
was produced via a straight mean of slit loss corrections for selected comparison 


stars, with the quality of slit loss correction being judged by eye. Comparison stars 
were removed from the slit loss correction calculations because they were either 
extremely blue, or not well aligned on their slits in the slit mask. LSR J1835 + 3259 
was slit-loss-corrected using the two reddest comparison stars. 

Wavelength dependence of optical variability. The amplitude of variability as a 
function of wavelength (which we term the difference spectrum) was estimated as 
follows. A rotational phase was assigned to each spectrum using the ephemeris of 
LSR J1835 + 3259. We created a spectrum representing the ‘high state’ and the 
‘low state’ of LSR J1835 + 3259 by averaging all spectra with phases between 
0.95-1.00 and 0.00-0.35 for the high state and 0.45-0.85 for the low state. The 
difference spectrum is then simply the difference between the high-state and the 
low-state spectra. 

Statistical uncertainties on the difference spectrum are negligible compared to 

systematic errors, which arise from imperfect correction of slit losses, sky subtrac- 
tion and removal of telluric absorption. To estimate these systematic errors, we 
produced a difference spectrum using an independent method. In this method, 
lightcurves were produced from a series of 5-A bins, and a sinusoid of fixed phase 
and period was fitted to the lightcurves. A difference spectrum was produced using 
the amplitude of the sinusoid fitted at each wavelength. The two methods yield 
very similar spectra, except in the range between 7,600 Aand 7,650 A, where the 
spectra are affected by telluric absorption. Subtracting the two difference spectra 
gave an estimate of the uncertainty, which is shown in Fig. 2. 
Modelling the variability. We construct a two-phase model of the optical emis- 
sion from LSR J1835 + 3259, with emission from a ‘pristine’ photosphere P, and 
emission from a surface feature S. We assume that the relative contribution from 
these two phases varies as the dwarf rotates. If the surface feature covers a fraction 
fn during the high state and a fraction f, during the low state, then the difference 
between high and low states can be written as: 


A=f,S+U—fr)P—fiS—(1—fi)P 


which can be simplified to: 


A=(fn —fi(S—P) 


We model the emission from the photosphere, P, of LSR J1835 + 3259 using an 
M8 template from a SDSS library of composite M-dwarf spectra*’. To ensure the 
absolute surface flux of the template is correct, we scale the template so that the 
bolometric flux matches that ofa DUSTY model atmosphere” with surface gravity 
of log(g) = 5.0 and an effective temperature 2,600 K. We model the emission from 
the surface feature, S, as a blackbody with temperature T,. Our models for S and P 
give the flux that is crossing a unit surface of the star, so to match them to our data 
they need to be multiplied by a factor N = (R/d)’, where R is the radius of LSR 
J1835 + 3259 and d is the distance to LSR J1835 + 3259. Since this is a simple 
multiplication of the model, this factor can be combined with the parameter ¢. The 
two free parameters of our model are therefore T;, and the scaling constant «. 

We draw samples from the posterior distributions of our parameters by a 
Markov-chain Monte Carlo (MCMC) procedure. Because our difference spectrum 
has unknown uncertainty, a nuisance parameter o is added. The uncertainty on 
the difference spectrum is set to o everywhere, except at wavelengths correspond- 
ing to emission lines, where the uncertainty is set to an arbitrary large value. 

Posterior probability distributions of Ty, ¢ and o are estimated using an affine- 
invariant ensemble sampler”. Uninformative priors were used for all parameters, 
with the exception that ¢ was forced to be positive. The MCMC chains consist of a 
total of 48,000 steps of which 24,000 were discarded as burn-in, giving 560, 770 and 
860 independent samples of T,, ¢ and o, respectively. The posterior probability 
distributions of our parameters are shown in Extended Data Fig. 1. The 7 of the 
most probable model was 860, with 920 degrees of freedom, showing that the 
model is an excellent fit to the data. The only wavelength regions where the model 
fails to reproduce our data are in the emission lines. The emission lines are 
probably caused by collisional excitation of the neutral atmosphere by the precip- 
itating electrons, leading to subsequent radiative de-excitation; a process not cap- 
tured by our simple two-phase model. 

The best-fitting parameters are T,, = 2,180 + 10 K, ¢ = (1.64 + 0.02) x 1077! 
and o = 0.018 + 0.0004 mJy. Using a model-based estimate for the radius and the 
measured distance for LSR J1835 + 3259 to correct ¢ for the (R/d)’ factor, we find 
that the difference in covering fractions between the high state and low state is 
between 0.5% and 1%. These error bars do not take into account systematic 
uncertainties. For example, the photospheric temperature of LSR J1835 + 3259 
is not determined to 10 K; adopting a different template, of M9 spectral type, for 
the pristine photosphere can alter T,, by approximately 50 K. Similarly, systematic 
errors in the scaling factor ¢ are probably around five times higher than the 
statistical errors quoted above. 

Modelling the high state. By fitting the difference spectrum we are able to con- 
strain the auroral surface feature’s spectrum with a minimum of assumptions. 
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Nevertheless, to give confidence in our modelling, one might wish to compare our 
observed high-state spectrum with the predictions of our model. This requires a 
few additional assumptions. The high state can be written as: 


H=N({f,S+(1—fn)P| 


Assuming the same pristine photosphere spectrum P and surface feature spectrum 
S that gave the best fit to our difference spectrum, we use an identical MCMC 
procedure to that outlined above (including a similar nuisance parameter o for the 
uncertainties) to draw posterior samples of N and fy,. We find f, = 0.024 + 0.004, 
N= (8.47 + 0.02) X 10°° and o = 0.196 + 0.003 mJy. The resulting fit to the high- 
state spectrum and residuals are shown in Extended Data Fig. 2. The constraints on 
N above, and the constraint on ¢ from fitting the difference spectrum allow us to 
estimate fy, — fi = 0.0194 + 0.0002 and hence fi = 0.005 + 0.004. 

A couple of pertinent features are visible in Extended Data Fig. 2. The first is that 
the quality of our model is limited at blue wavelengths by the signal-to-noise ratio 
in the M8 template spectrum that we have adopted for the pristine photosphere. 
Although our nuisance parameters can account for this to some degree, this is 
another reason why the statistical errors quoted on parameters are probably 
underestimates. The second is that there are features in the residuals of similar 
amplitude to features in the difference spectrum. These features arise because the 
M8 template is not a perfect fit to the pristine photosphere. However, this does not 
mean that our fit to the difference spectrum is unreliable. If we label our adopted 
photosphere template P, and the true photosphere P’, then the error in the high- 
state spectrum will be: 


H—H'=N(1—f,)(P—P’?) 
whereas the error in the difference spectrum will be: 
4—-A’=NGfx—fi\(P—P) 


Thus, residuals in the high-state spectrum will also be present in the difference 
spectrum, but reduced in size by a factor of more than 50; this implies that the 
residuals will be smaller than the value we adopt for our nuisance parameter when 
fitting the difference spectrum. 

An orbiting exoplanet as an electrodynamic engine. An orbiting planetary body 
embedded within the magnetosphere of LSRJ1835 + 3259 will have motion relative 
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to this magnetosphere and any associated frozen-in plasma. If the planet is con- 
ducting, this motion leads to the generation of an electric field across the planet that 
can power auroral emissions on LSRJ1835 + 32597. The expected power produced 
is proportional to the intercepted flux of magnetic energy, P x vB, Rope» Where B 
is the component of the magnetic field perpendicular to the planet’s orbital motion, 
vis the planet’s velocity relative to the local magnetic field and Ros is the size of the 
obstacle created by the planet, the latter defined by its ionosphere or magnetosphere 
depending on whether the planet is magnetized or unmagnetized’’. For example, a 
simple scaling from the Jupiter-Io system indicates that an unmagnetized Earth- 
sized planet orbiting within 20 radii (<30-h orbital period) of LSR J1835 + 3259 
will generate a current sufficient to power the observed aurorae. 

However, the resulting auroral emission is expected to be strongly modulated by 
the orbital period of the planet, whereas in the case of LSR J1835 + 3259, the 
observed periodicity of 2.84 h is consistent with rotation of the dwarf, as inferred 
from rotational broadening of spectral lines. Indeed, a planet orbiting with this 
period would be within the Roche limit of LSR J1835 + 3259 and would be torn 
apart by tidal forces. An alternative possibility arises if the magnetic field at the 
location of the planet varies substantially as LSR J1835 + 3259 rotates, which will 
be the case if LSR J1835 + 3259 possesses a non-axisymmetric magnetic field. For 
example, a tilted dipole would result in as much as fourfold variation in the current 
produced across the orbiting planet during each rotation of LSR J1835 + 3259. 
The period of the auroral emission would then be Tay, = Tyot(1 + Tyrot/Torp) and 
would approach the rotation period for large values of To:p/T;ot- In this scenario, 
the aurorae would display modulation close to the rotation period as well as on the 
orbital period. 

Code availability. The code used to model the auroral feature of LSR 
J1835 + 3259 is publicly available at https://github.com/StuartLittlefair/Isr1835. 
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Extended Data Figure 1 | Posterior probability distributions for two-phase model parameters. Greyscales with contours show our estimates of the joint 
posterior probability distributions for all combinations of parameters, while marginal posterior distributions are shown as histograms. 
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Extended Data Figure 2 | The high state spectrum of LSR J1835 + 3259. a, The high state spectrum of LSRJ1835 + 3259 (black) is shown along with the model 
that best fits the high state spectrum (red). b, The residuals between the model and the fit. 
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Real-time observation of interfering crystal 
electrons in high-harmonic generation 


M. Hohenleutner!*, F. Langer'*, O. Schubert", M. Knorr!, U. Huttner”*, S. W. Koch?, M. Kira? & R. Huber! 


Acceleration and collision of particles has been a key strategy for 
exploring the texture of matter. Strong light waves can control and 
recollide electronic wavepackets, generating high-harmonic radi- 
ation that encodes the structure and dynamics of atoms and mole- 
cules and lays the foundations of attosecond science’*. The recent 
discovery of high-harmonic generation in bulk solids** combines 
the idea of ultrafast acceleration with complex condensed matter 
systems, and provides hope for compact solid-state attosecond 
sources®* and electronics at optical frequencies***’®. Yet the 
underlying quantum motion has not so far been observable in real 
time. Here we study high-harmonic generation in a bulk solid 
directly in the time domain, and reveal a new kind of strong-field 
excitation in the crystal. Unlike established atomic sources’*?", 
our solid emits high-harmonic radiation as a sequence of subcycle 
bursts that coincide temporally with the field crests of one polarity 
of the driving terahertz waveform. We show that these features are 
characteristic of a non-perturbative quantum interference process 
that involves electrons from multiple valence bands. These results 
identify key mechanisms for future solid-state attosecond sources 
and next-generation light-wave electronics. The new quantum 
interference process justifies the hope for all-optical band-struc- 
ture reconstruction and lays the foundation for possible quantum 
logic operations at optical clock rates. 

Ultrafast time resolution in the few-femtosecond or attosecond 
regime has provided systematic insight into quantum control of indi- 
vidual atoms’’, molecules’’, and solids'*. A spectacular example has 
been to utilize the carrier wave of strong light pulses to control subcycle 
electron motion in atoms and molecules and follow the wavepacket 
dynamics directly via the temporal structure of high-order harmonic 
(HH) emission’*"""*. Quantum theories’® suggest, for example, that 
maximum HH emission occurs at a distinct delay after the crest of the 
driving field, reflecting the time needed to accelerate electrons in the 
continuum’. Subcycle resolution has also been used to unravel novel 
interference phenomena in molecules'*!*”. 

In comparison, subcycle control of electrons in solids is still in its 
infancy, despite its promise of novel quantum physics*”"°*?*? and 
applications in all-optical band-structure reconstruction’’”’, light- 
wave-driven electronics***’°™* or attosecond science**. Only recently 
high-harmonic generation (HHG) has been extended to bulk solids, 
setting bandwidth records in the terahertz-to-ultraviolet spectral win- 
dow*». An intriguing interplay of coherent interband polarization and 
intraband electron acceleration in the regime of dynamical Bloch oscil- 
lations has been suggested to underlie HHG in bulk crystals*’”?; 
such a process can explain, for example, the observed linear scaling 
of the HH cut-off frequency with the driving peak field*°, in contrast 
to a quadratic behaviour found in atoms and molecules. A detailed 
understanding of the microscopic electron motion as well as all envi- 
saged applications depends critically on direct access to the temporal 
structure of HHs from bulk crystals*”**, which has been elusive. 

Here we resolve the temporal fine structure of terahertz-driven 
phase-locked HH pulses from a bulk semiconductor. In addition, we 


directly measure the HH timing with respect to the driving field on the 
same absolute timescale for the first time. Our data reveal that the 
radiation is emitted as a train of almost bandwidth-limited bursts syn- 
chronized with the maxima of the field. Differently from atoms, the 
bursts are emitted only during every second half-cycle. We show that 
these signatures originate from a new type of non-perturbative inter- 
band quantum interference involving electrons below the Fermi energy. 

Multi-octave spanning HH pulses (Extended Data Fig. 1) are gener- 
ated by focusing intense phase-stable multi-terahertz transients centred 
at a frequency of vr, = 33 THz (Fig. 1a, black waveform) into a single 
crystal of the semiconductor gallium selenide (GaSe). To analyse the HH 
pulses with subcycle resolution, we introduce a novel combination of 
cross-correlation frequency-resolved optical gating (KFROG) and elec- 
tro-optic sampling (Fig. 1a). The generated HHs and the terahertz driving 
field (red waveform) are superimposed with a delayed 8-fs near-infrared 
gate (blue waveform) and focused into a 10-m-thick BBO (f-barium 
borate) crystal. Nonlinear frequency mixing simultaneously yields sum- 
frequency signals encoding the temporal structure of HH pulses as well as 
electro-optic traces of the terahertz driving waveform (see “Experimental 
setup’ in Methods). In this way, the relative timing of HH emission with 
respect to the terahertz field is determined with an uncertainty corres- 
ponding to a fraction 7/20 = 1.5 fs of the oscillation period T of the 
driving waveform (see ‘Determination of the absolute timescale’ in 
Methods and Extended Data Fig. 2). 

Figure 1 compares the terahertz pump field (Fig. 1b, black curve) 
with the spectrally integrated (Fig. 1b, shaded curve) and the spectrally 
resolved (Fig. 1c, colour map) sum-frequency signal. A double-blind 
XFROG algorithm (see ‘Double-blind XFROG algorithm’ in Methods) 
allows us to retrieve the actual temporal envelopes and relative phases 
of both the gate and the HH pulses” from the sum-frequency data. The 
consistency of this analysis is confirmed by the excellent agreement 
between the measured and reconstructed two-dimensional spectro- 
grams (Fig. 1c and d) and between the intensity envelope of the gate 
pulse retrieved from the spectrogram and an independent second 
harmonic FROG measurement (Extended Data Fig. 3). The retrieved 
time trace of the HH intensity I(t) contains spectral contributions 
from 50 to 315 THz (Extended Data Fig. 3). Iyy44(t) consists of a train of 
three ultrashort bursts (Fig. le, shaded curve) featuring three remark- 
able properties, as follows. (1) The maxima of Iyy(¢) and Epy,(t) 
coincide within +2 fs = T/15 (vertical dashed lines). This behaviour 
is in contrast to ballistic electron recollision models’” where the max- 
imum of Iyy(t) is distinctly delayed with respect to the maximal driv- 
ing field’. (2) Unlike in atomic HHG, Iiy:(t) is suppressed by one order 
of magnitude for field maxima of negative polarity. (3) The duration of 
the unipolar HH bursts is as short as 7 fs (full-width at half-maximum 
of intensity), which corresponds to a single oscillation period of 
the fourth harmonic order. Such pulse widths are expected only if all 
frequency components within the smooth spectral envelope (Extended 
Data Fig. 3) generated during one half-cycle of the driving field are 
emitted almost simultaneously. This is indeed the case as can be seen in 
Fig. 1c and d, where all sum-frequency components peak roughly at 
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Figure 1 | Subcycle time structure of HH emission from a bulk crystalline 
solid. a, Experimental setup of the novel cross-correlation scheme: a multi- 
terahertz transient (black) is focused onto a bulk GaSe crystal (thickness 60 Jim) 
for HH generation. The resulting waveform (red) is overlapped with a near- 
infrared gating pulse (blue, pulse duration 8 fs, centre wavelength 840 nm) 
using a beam splitter (BS) and focused into a BBO crystal (NX, thickness 10 Jim) 
for simultaneous electro-optic interaction and sum-frequency generation. 
These signals (green) are split with a beamsplitter (BS2) and simultaneously 
recorded with a standard electro-optic sampling (EOS) setup and a 
spectrograph with a cooled silicon CCD detector (SPEC). b, Waveform of the 
multi-terahertz driving field featuring peak amplitudes of 47 MV cm ' anda 
central frequency of 33 THz confirmed by electro-optic detection in a ZnTe 


the same delay time t (vertical broken line in Fig. 1c and d), suggesting, 
at most, a weak spectral chirp of the HH pulses. 

The observed time structure implies a quasi-instantaneous and uni- 
polar generation mechanism. In order to identify this key ingredient, 
we first reproduce Iy4(t) by a full quantum theory*”° (see ‘Quantum 
many-body model’ in Methods) including intra- and interband 
dynamics with two conduction and three valence bands (Extended 
Data Fig. 4). Our calculation reproduces the experimentally observed 
behaviour of Iy74(t) in great detail (Fig. 2a, red solid curve). In particu- 
lar, the emission peaks within 2 fs about the positive field crest while it is 
strongly suppressed for negative field extrema. In contrast, recent mod- 
els accounting for only two electronic bands have consistently predicted 
HH emission in a bipolar fashion’ and have suggested analogies with 
atomic HHG” where mostly two classes of electronic states have been 
considered: the ground state and the continuum of ionized states. In a 
solid, however, the simultaneous interaction of each electron with 
many atoms of the crystal lattice forms a series of electronic bands. 
As soon as more than two bands are included, electrons may be excited 
through multiple paths inducing quantum interference. 

Figure 2b illustrates a minimal model for this scenario accounting 
for two valence bands (h, and h,) and one conduction band (e;). 
Excitation of an electron from band h, to band e; may either proceed 
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crystal (thickness 6.5 j1m, black curve). Signals obtained from sum-frequency 
mixing of HH and gating pulses are shown after integration over a frequency 
window from 490 THz to 523 THz (green curve). Dashed vertical lines 
highlight the local maxima of the terahertz field and error bars indicate the 
standard deviation of the extracted sum-frequency peak position for 12 
separate measurements. c, d, Spectrograms showing the intensity of the 
measured sum-frequency signal for different delay times and frequencies as 
recorded with a Si CCD detector (c) and reconstructed using a double-blind 
XFROG algorithm (d), respectively. White dashed lines highlight the 
maximum sum-frequency intensity. e, Temporal shape of intensity [yyy (red) of 
the reconstructed HH pulse sequence relative to the driving multi-terahertz 
waveform (black). Dashed lines and error bars are the same as in b. 


by multi-photon transitions directly between two bands, h; => e), or 
indirectly via an additional band, h, > h,— e;. The terahertz pulse 
is far off either resonance, but it is sufficiently strong to generate 
non-perturbative excitations where electron populations change 
drastically on a subcycle scale. We show that such non-perturbative 
transitions tend to balance the respective weights of the excitation 
paths because the extremely strong field forces the electrons to 
oscillate between the non-resonantly coupled states (see ‘Inter- 
ference path efficiency’ in Methods). Nonetheless, the excitation paths 
maintain their perturbatively assigned symmetry (see ‘Strong-field 
quantum interference’ in Methods), featuring an odd transition ampli- 
tude A,(—Eryz) = —Ao(Ernz) with respect to the driving field for the 
direct excitation and an even amplitude A,(— Eyy,) = A.(Ery,) for the 
indirect path h, — hy —> e, (Fig. 2b, Extended Data Fig. 5). A coherent 
superposition of both yields a total amplitude of A.(|E;y,|) + (—) 
A.(|Eruc|) for positive (negative) Ery,, respectively (see 
‘Perturbative versus non-perturbative quantum interference’ in 
Methods). Hence, the sign of the field controls the total outcome of 
HH transitions. Note that the transition h; — h, connecting bands 
below the Fermi level is initially Pauli-blocked but strong excitation 
can significantly empty h,, for example, via the transition h,— e;, 
clearing the path h; > h)—> e). 
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Figure 2 | Non-perturbative quantum interference in HH emission. 

a, Driving terahertz field (black curve) and calculated intensity envelope of HH 
emission as a function of time for F., = 0 (broken red curve, magnified by a 
factor of 25) and F.. = 1 (solid red curve). Dashed vertical lines highlight the 
local maxima of the terahertz field. b, Simplified three-band schematic of 
different ionization pathways from valence band h, to conduction band e;. The 
direct transition amplitude A,(Ery,) is an odd function of the driving field, 
Ao(—Ernz) = —Ao(Eruz). The amplitude A.(Erp,) of the indirect path 


We test the viability of this concept by a systematic switch-off ana- 
lysis using our five-band computation that includes all relevant transi- 
tions. By artificially multiplying the dipole moment dj, ,, between the 
hole bands h, and hg as well as all other similar terms with a coherent 
control factor F,., we eliminate the indirect paths needed for non- 
perturbative quantum interference. Figure 2a compares the intensity 
envelope, I(t), with (red solid line, F.. = 1) and without (red dashed 
line, F.. = 0) the indirect paths. Switching off the quantum interfer- 
ence, that is, considering only direct transitions (F., = 0), produces 
bursts at positive and negative crests of the field. Interestingly, the 
bursts become delayed by roughly T/4 with respect to the field extrema, 
which is consistent with a delay expected in an atomic recollision 
model*’”. However, opening the indirect paths (F,, = 1) synchronizes 
the emission with the driving field. More specifically, the interband 
coherence is driven such that the quantum interference and the result- 
ing HH emission are strongest during the presence of the electric field. 
This process maximizes (suppresses) the emission with the positive 
(negative) crests of the field. The destructive interference is not perfect, 
leaving small HH remnants at negative peak fields. 

Figure 2c shows computed normalized Iyy4(t) traces as a function of 
F., (unscaled representation in Extended Data Fig. 6). By gradually 
suppressing the coherent-control paths, emission appears as delayed 
bursts after each field maximum and minimum. Nevertheless, the trans- 
ition is not smooth, but contains non-trivial oscillations and bifurca- 
tions. These features are caused by terahertz-induced band mixing, 
which modulates electronic populations and Iy4(¢) and underpins the 
non-perturbative character of the interband excitations. 

Under the extremely non-resonant conditions of our experiment 
(hvre, < E,/14 with E, = 2.0eV being the bandgap energy of GaSe), 
band-to-band transitions require multi-terahertz pump photons. Non- 
perturbative excitations can non-resonantly drive all these transitions to 
exhibit large population transfer (Extended Data Fig. 5), and the related 
processes are robust against variations of the terahertz field strength and 
photon energy. Therefore, the quantum interference should be detectable 
for a broad range of field amplitudes Ey;;, and terahertz photon energies. 
In fact, the HH maxima remain synchronized with the positive peak of 
the driving field for both experimental (Fig. 3a) and theoretical (Fig. 3b) 
traces of Iy4(t) when the terahertz frequency is changed between 25 and 
34 THz, whereas the temporal separation of the emission bursts grows 
with the oscillation period of the driving field. Both measured (Fig. 3c) 


574 | NATURE | VOL 523 | 30 JULY 2015 


Delay time (fs) s 


(h, > h2 = e;) is the product of two odd functions, resulting in even symmetry 
A.(—Eryz) = Ae(Eruz). The indirect path features a transition (h, — hg) that is 
initially blocked by Pauli exclusion and only opens under strong-field 
excitation. c, HH intensity envelopes computed within the five-band modelas a 
function of delay time and the coherent control factor F.. regulating coherent 
transitions between occupied valence bands. Bright colours mark strong 
emission, dark colours mark weaker emission (see key). All time traces are 
normalized separately. 


and computed (Fig. 3d) traces of I-y4(t) also remain unipolar and quasi- 
instantaneous when the terahertz field amplitude Ey, is changed. The 
contrast is even enhanced for higher field strengths. 

Recent studies have demonstrated that strong terahertz fields, 
needed to create HH emission, can also coherently accelerate electrons 
throughout the Brillouin zone before scattering occurs**?'””. Owing 
to resulting dynamical Bloch oscillations, electrons may undergo one 
or more Bragg reflections within one half-cycle of the driving field, 
emitting high-frequency radiation at the quasi-instantaneous Bloch 
frequency vp. Since vg is proportional to Eyy, (see ref. 28), the fre- 
quency of the Bloch-related contribution to HHG should trace the 
temporal profile of the driving field. Our XFROG algorithm allows 
us to retrieve the temporal phase ¢yy(f) of the HH pulse train 
(Extended Data Fig. 7), from which we obtain its instantaneous fre- 
quency v,(t) = (22) Odyy/Ot, weighted by the spectral amplitude 
within our detection bandwidth (Fig. 3e). All time traces of v;(t) mea- 
sured for different terahertz amplitudes follow a universal double-chirp 
pattern, which is a fingerprint of dynamical Bloch oscillations: after a 
monotonic increase during the rising slope of Ey4,(t), v; peaks approxi- 
mately at the maximum of the applied field and decreases again fol- 
lowing the abating driving field. With increasing amplitude, the 
instantaneous frequency in a single HH pulse blue-shifts globally while 
its maximum broadens, develops shoulders and finally morphs into a 
non-monotonic pattern for the highest field strengths. Our quantum 
theory reproduces even these non-trivial features well (Fig. 3f). 

The combination of non-perturbative quantum interference and 
dynamical Bloch oscillations may be systematically harnessed for ultra- 
short pulse shaping. By varying the terahertz carrier frequency (Fig. 3a, b) 
and the carrier-envelope phase (CEP) of the driving waveform 
(Extended Data Fig. 8), the global shape of the HH pulse sequence can 
be tailored, whereas the frequency modulation within individual bursts is 
reproducibly set by the terahertz amplitude. Almost bandwidth-limited 
pulses may be generated—especially if the phase-flattening effect for high 
peak fields is exploited (Fig. 3e, f). We expect that in our experiment, 
suitable high-pass filtering of the HH pulses may allow for pulse dura- 
tions as short as 3 fs in the infrared and visible domain (Extended Data 
Fig. 9). Since the principle of solid-based HHG is fully scalable to the 
ultraviolet, even shorter pulses may be possible in wide-gap materials. 

In conclusion, the relative timing of HHG with respect to the driving 
field, the unipolar response, and a non-monotonic frequency modulation 
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Figure 3 | Tunability and robustness 


wT 
= a Experiment = 107¢ Experiment S e Experiment} of non-perturbative quantum 
= = = 115 interference. a, b, Measured (a) and 
8 1.0, 5 2 calculated (b) intensity envelopes [yyy 
£& £ os 5 of emitted HH for driving fields 
8 05 8 = 105-5 featuring central frequencies of 25, 30 
ae a 3 and 34 THz, respectively. Darker 
E E 5 colours represent higher central 
Z 0.0 Z 0.0 S gs frequencies. c, d, Measured (c) and 
no 
-40 -20 0 20 40 -50 -25 0 50 =£ 5 oO 5 calculated (d) HH intensity envelopes 
Delay time (fs) Delay time (fs) Delay time (fs) for driving peak fields of 26, 31 and 
15 > 44MVcm (experiment) and 19, 22 
3 a 
i: b Theory = 1.0 d Theory = f Theory and 31MVcm (theory). Brighter 
= = = 100+ colours represent higher peak fields. 
3 ‘@ 3) e, f, Instantaneous frequency y; of the 
co) @ 3. central HH emission bursts shown in 
2 = 0.5 £ 90- cand d, respectively. Brighter colours 
N qg 2 represent higher peak fields. 
o 2 
E E £ 
2 2 0.0 5 80- 
-20 0 20 40 -50 -25 0 = -5 0 5 
Delay time (fs) Delay time (fs) Delay time (fs) 


provide direct insight into the terahertz strong-field-driven motion of 
electrons in GaSe. We identify a non-perturbative quantum interfer- 
ence between interband transitions as a salient HH generation mech- 
anism. In its most generic form, this strong-field mechanism can occur 
if (1) there is a closed-loop triangle system of states that are all mutually 
coupled by dipole transitions (Extended Data Fig. 10), and (2) the 
terahertz field is far below these resonances and (3) strong enough to 
generate non-perturbative excitations changing carrier populations 
substantially (see Methods and Extended Data Fig. 5). In contrast to 
established techniques of perturbative quantum interference between 
one- and two-photon transitions inducing directed charge and spin 
currents”, our new concept is robust even at extremely strong fields. 
Thus, it may inspire new techniques for quantum-logic operations” 
based on sturdy, non-perturbative transitions between strongly coupled 
energy bands. Driving coherences between initially fully occupied val- 
ence bands, HHG provides all-optical access even to details of the band 
structure hidden below the Fermi level. Furthermore, the direct obser- 
vation of lightwave-controlled electron dynamics marks the way 
towards a complete microscopic picture of HHG in solids, ultrafast 
electronics, and novel solid-state CEP-stable attosecond sources. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Experimental setup. A femtosecond titanium-sapphire laser amplifier (repetition 
rate, 3 kHz; pulse energy, 5.5 mJ; pulse duration, 33 fs; centre wavelength, 805 nm) 
is used to pump two parallel dual-stage optical parametric amplifiers tunable 
between centre wavelengths of 1.1m and 1.8 ,1m which deliver signal pulse 
energies of up to 0.5 mJ each. We generate phase-locked multi-terahertz pulses 
via difference frequency generation between these spectrally detuned near-infra- 
red pulse trains”. 

High-order harmonics (HHs) are obtained by focusing these intense waveforms 
onto a gallium selenide (GaSe) crystal. Using samples 220 tm thick, power con- 
version efficiencies of approximately 7% for the whole HH spectrum covering the 
spectral range from 45 to 675 THz are observed. Resulting pulse energies of 
roughly 350nJ are measured for a peak driving field of 72MVcm ‘ close to 
the observed damage threshold of GaSe. The threshold decreases for thinner 
samples suggesting thermal heating as the relevant damage mechanism. For the 
time-resolved experiments, a 60-j1m-thick free-standing GaSe crystal is irradiated 
(centre frequency, 33 THz; electric peak field, 47 MV cm™ 1. see inset Extended Data 
Fig. 1) under normal incidence to avoid phase matching effects. Under these con- 
ditions, the Bloch period amounts to 5.2 fs, corresponding to an energy of 0.8 eV. 
The polarization of the driving field points along the I-K direction of the hexagonal 
Brillouin zone of GaSe. The resulting spectrum (Extended Data Fig. 1) is recorded 
by means of a monochromator in combination with a calibrated pyroelectric 
detector (PED) and a lead sulfide (PbS) diode, as well as spectrometers with indium 
gallium arsenide (InGaAs) and cooled silicon (Si) detector arrays, respectively. 

A YAG-based supercontinuum source with a combination of chirped mirrors 
and a prism compressor provides 8-fs gating pulses with a centre wavelength of 
840 nm (Extended Data Fig. 3). Both the gating and the HH pulses are focused 
onto a 10-jtm-thick B-BaB2,O, (BBO) crystal, which mediates two distinct non- 
linear optical processes simultaneously, as follows. (1) The HHs are mixed with the 
gate by ultrabroadband sum-frequency generation. This cross-correlation signal is 
recorded as a function of the delay time t between the HH and gate pulses and 
encodes the temporal structure of both. (2) The terahertz fundamental wave 
induces an electro-optic polarization rotation of the gate pulse. This signature 
Szos-ppo(t) serves as a time marker directly linking the emitted HH intensity 
Tyu(t) with the generating terahertz field Ey14,(t) (see below for details). 
Determination of the absolute timescale. For a complete time-domain picture of 
high-harmonic generation (HHG) in solids, the emitted signal (Ey) has to be 
measured and temporally correlated with the driving field (Ey,,) with a precision 
significantly better than one optical cycle of Ery,. The most accurate way to 
determine the relative timing is to detect the co-propagating terahertz wave and 
the generated HHs simultaneously within the same detector. Additionally, the 
detector response as well as all propagation effects between HH generation and 
detection of the waveforms has to be taken into account. 

In our experiment, we superimpose the HH waveform including the fun- 
damental terahertz field and an ultrashort near-infrared pulse within the same 
10-tm-thick BBO crystal to generate sum-frequency (Sgr) and electro-optic sig- 
nals (Sgos-ppo). This electro-optic trace is employed as a temporal reference 
marker only. The strong material dispersion of BBO” in the multi-terahertz 
spectral region distorts the fundamental waveform. For a faithful determination 
of the temporal profile of the terahertz field, we exchange the BBO crystal with a 
6.5-jum-thick (110)-oriented zinc telluride (ZnTe) sampling crystal glued on a 
300-um-thick electro-optically inactive ZnTe substrate and record the electro- 
optic signal (Sgos-znre) for different delay times (Extended Data Fig. 2). Via an 
analysis of the Gouy phase shift® for several recordings (‘z-scan’), we ensure that 
the ZnTe crystal is placed in exactly the same position as the BBO crystal. 

The accuracy of this correlation procedure is estimated to be 1.2 fs by repeatedly 
exchanging the BBO and ZnTe crystals and calculating the standard deviation of 
the temporal delay between the transients recorded in BBO and ZnTe detectors, 
respectively. The error resulting from a misalignment of the detection crystals is on 
the order of 0.1 fs and may be neglected. 

Finally, we correct for effects of phase-matching and dispersion of the 7? 
nonlinearity using standard procedures™ (compare Extended Data Fig. 2b) to 
obtain the terahertz field Ep. at the detector position for a given signal 
Szos-znte- The thickness of the ZnTe crystal dzwre is confirmed by optical inter- 
ference to be dzpre = 6.5 um + 0.7 um, resulting in an error of the peak field 
positions of 0.9 fs due to the uncertainty in the determination of the crystal thick- 
ness. Consequently, the relative error in the temporal delay between the sum- 
frequency signal (Ssp) generated by the emitted HHs and the terahertz field at 
the detection focus is 1.5 fs, that is, approximately 20 times shorter than one 
oscillation period of the terahertz waveform. 

Propagation effects between HHG and detection of the fields may modify 
the measured signals additionally. We employ only reflective optics between the 
generation and detection stages. Consequently, differences in group velocities are 


negligible in our experiment. The GaSe sample is placed directly in the focus of the 
terahertz wave, leading to an additional Gouy phase” offset of /2. For our focus- 
ing conditions, the Rayleigh length of the driving terahertz beam is about 500 jum, 
which is much longer than the crystal thickness of 60 jum and allows us to neglect 
the uncertainty in the sample position. The Gouy phase shift at the detection focus 
is also fully accounted for. 

During propagation inside the sample, dispersion of the HH group velocity 
would lead to a relative delay of the spectral components corresponding to differ- 
ent harmonic orders. Since we deliberately avoid phase-matching during HHG in 
GaSe, the emission of HHs is confined to a thin region near the back facet of the 
GaSe crystal with a coherence length of, for example, 6 jtm for the 9th harmonic, 
and no substantial chirp due to linear material dispersion is observed in the 
experiment. Note that the rear facet itself does not have an essential role in 
HHG. The HH radiation rather originates from the bulk of the crystal, as seen 
from the fact that phase-matching effects occur in thicker samples’. 
Double-blind XFROG algorithm. For the characterization of the temporal struc- 
ture of terahertz HHs, we perform cross-correlation frequency resolved optical 
gating? (XFROG) between an 8-fs gating pulse and the HH pulse train. Our 
analysis is based on standard XFROG algorithms***’, customized to take into 
account the ultrabroadband nature of the HH spectra. The gating pulse encom- 
passes a spectral bandwidth of more than 100 THz (Extended Data Fig. 3), while 
the emitted HH spectrum continuously covers multiple optical octaves. As a 
consequence, ultrabroadband sum-frequency mixing signals comprise a fre- 
quency range of more than 300 THz (Extended Data Fig. 3), benefitting from 
the huge acceptance bandwidth of the 10-jm-thick BBO crystal. Despite the 
enormous bandwidths, the XFROG algorithm reliably converges for discretionary 
runs and data sets and reconstructs the measured spectrograms to a very high 
degree of congruency (compare Fig. 1). The robustness of the scheme is addition- 
ally confirmed by comparison of the retrieved temporal shape of the gating pulse 
to an independent SHG FROG measurement, reconstructed using a separate 
algorithm** (Extended Data Fig. 3b). 

The robust XFROG reconstruction allows us to retrieve the subcycle profile of 
both the intensity envelope Iyj4(f) and the relative phase dyy4(t) of the HH bursts. 
While the temporal structure of y;4(t) is shown in Fig. le, Extended Data Fig. 7 
depicts the relative phase. The modulations of the phase are relatively small as 
already indicated by the almost simultaneous appearance of all sum-frequency 
spectral contributions (Fig. 1c). To study the phase evolution in greater detail, we 
derive the instantaneous frequency v, = (2m) '0y4/0t by numerical differenti- 
ation. The instantaneous frequency is ramped up during the rising edge of a HH 
burst, peaks together with the intensity envelope and decreases again (Extended 
Data Fig. 7). This slight double-chirp is an indicator of Bloch-type acceleration of 
carriers within the conduction band. Furthermore, the phase retrieval allows for 
the reconstruction of the HH field trace (compare red waveform in Fig. 1a). 
Quantum many-body model. We use the HHG theory developed in refs 20 and 
39 to describe the coherent interplay of interband excitations and intraband cur- 
rents and apply this microscopic model to study the time resolved emission of 
GaSe excited with extremely strong terahertz fields. The numerical calculations are 
based on a one-dimensional five-band model as depicted in Extended Data Fig. 4a, 
including two conduction bands (A = e,, e2, blue lines) and three valence bands 
(A= hy, hg, hg, red lines). The individual bands are modelled by effectively one- 
dimensional tight-binding bands”, with material parameters taken from ref. 40. In 
the following, we will review relevant details, while more comprehensive deriva- 
tions can be found in ref. 5. 

The presence of a sufficiently strong external electric field E(t) induces inter- 
band transitions where electrons are transferred between two different bands 2 
and 4', as schematically depicted by the blue spheres and the black arrow in 
Extended Data Fig. 4b. The dipole-matrix element dj; determines the strength 
of the transition. The inter-valence-band dipole-matrix element d),n, exceeds the 
values of dj,,-,and d),., by one order of magnitude*!. The strongest driving fields 
applied in the calculations correspond to peak Rabi energies of 1.5 eV for inter- 
valence-band transitions and 0.12 eV for valence-to-conduction-band transitions. 
In addition to the interband transitions, E(t) also drives intraband currents where 
electrons and holes are accelerated in their respective bands*$?°?75?7, see 
Extended Data Fig. 4c. 

Defining electron and hole occupations f? and microscopic polarizations pi“ for 
2# 2’, the well-known semiconductor Bloch equations*’ (SBEs) describe the time 
evolution of polarizations and carrier occupations. The SBEs read 
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for the microscopic polarizations between a valence band and a conduction band. 
In order to capture the dominant effects of the higher-order correlations, we 
include a phenomenological dephasing via the decay time T, = 1.1 fs in the polar- 
ization dynamics, following refs 5 and 44. 

Multi-photon coherence is particularly susceptible to dephasing because any 
scattering scrambles its phase relations. In fact, similarly fast decay times of coher- 
ences have been reported in different systems ranging from bandgap dynamics in 
silicon** (0.5-fs decay) via general HHG in solids (ref. 22 (4-fs decay) and ref. 46 
(sub-10-fs decay)) to atomic HHG” (sub-10-fs decay). This ultrafast timescale 
results from electron-electron scattering** as well as polarization-polarization 
scattering’, which is fostered by the broad carrier and polarization distribution 
created in HHG. In addition, the terahertz field excites a large amount of electrons 
much faster than Coulomb screening builds up. The unscreened Coulomb 
interaction enhances scattering rates significantly, as discussed for example, in 
ref. 47. The resulting dynamics are numerically solved for an initially unexcited 
system using an excitation field E(t) modelled closely to the experimental terahertz 
waveform. 

The emission intensity of a coherently excited semiconductor consists of a 
polarization source P(t) and a current source J(£), 


PU)= So dix(Kpy” and I(t) = So jnkyfe (3) 
1k 2k 
with the current matrix element j;(k) = lel el Vet. 


To account for the — of caret in a realistic system, a phenomeno- 


logical carrier relaxation h a < fi ee = 


—- “ff ‘4 is included, which damps the anti- 
» Ats 2 
symmetric part fi, = 5 | kK SC »| of the carrier distributions, producing the 
correct decay for the currents. We apply this relaxation to all valence bands and 
the first conduction band, with the relaxation time t = 7 fs. Owing to its flat shape, 
the second conduction band does not contribute essentially to the currents. 
The emission intensity is defined by the total effective current given by the 
sum of intraband currents J(t) and the rate of change of the macroscopic 


7 
polarization® a P(t), 


a 
Eunc(t) « = PW) +I(0) (4) 


The envelope and phase of the numerically computed time trace is extracted via a 
Hilbert transform’. As experimental evidence from phase matching effects (see 
‘Determination of the absolute timescale’ in Methods) suggests, surface effects do 
not contribute considerably to the HHG and are, thus, neglected in our model. 
Strong-field quantum interference. As described in the previous section, all our 
numerical analysis is performed with the five-band model. However, in order to 
obtain an intuitive understanding of the dominant quantum-interference paths 
contributing to the experimental observations, it is sufficient to consider a sim- 
plified system that includes only the three most relevant bands. As depicted in 
Extended Data Fig. 5a, we use two valence bands h, and h, (red solid lines) and one 
conduction band e (blue solid line). The transition probabilities between the bands 
are given by the magnitude of the dipole-matrix elements dey,, den, and dh,n, 
(black arrows). For an initially unexcited system, only transitions from the two 
valence bands h, and h, to the conduction band e are possible, as both valence 
bands are completely filled, preventing any transition between them. 

In order to study the different excitation paths leading to the effective transition 
h, ~e, the sources 


ih— op = 


ex. 


den, (E()(1 — fg — fy") + dan, (DECOR + deh, (KECK (5) 
based on equation (2) are analysed for the situation of three bands depicted in 
Extended Data Fig. 5a. The gradient term is omitted here, because it only redis- 
tributes the carrier momentum inside a band without inducing transitions 
between different bands. For the sake of a simpler notation, we will also omit 
the explicit k-dependence from now on. 

The first excitation path h, > e (first term in equation (5)) is depicted by 
the blue arrow in Extended Data Fig. 5b. The terahertz driving field creates a 
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polarization between the first valence and the conduction band, which is propor- 
tional to the driving field strength and the corresponding dipole-matrix element. 
We call this the direct excitation path, ph"*| jirec¢ © deh, E(£); it is initially linear in 
the electric field E(t) and thus of odd order with respect to the driving field sign. 
For strong fields, this polarization will eventually excite electrons by multi-photon 
absorption to the conduction band. This change in the carrier occupations given 
by Fé laivect © dem E(t) (Ph | airect)” 43, E(t)” will modulate the initially linear 
dependence via the electron occupations of the involved states. Nevertheless the 
odd symmetry of the transition is preserved: Due to the mutual coupling of polar- 
ization and carrier occupations, that is, Pk “| airect © deh, E(t) — den, E(t) lairect aNd 
FE legivect © dehy E(t) (PR | ainect)” «da, E(t)” the initially linear dependence will be 
replaced by a series of ee which are all of odd order in the driving field, that 
is, Pi "| airect © deh, E(t) — di, E(t)? + 

In the same way, the excitation pak h2— e will also be driven by the electric 
field (three-dimensional red arrow in Extended Data Fig. 5c). Hence, 
p* ocdeh, E(t) initially has a linear dependence on the driving field. Eventually 
electrons from h, will also be excited to the conduction band, which will create 
vacancies in h, and allow for transitions h,; — hz between the valence bands. 

As soon as h, — h, transitions (flat red arrow in Extended Data Fig. 5c) become 
possible, the second excitation path, described by the second term in equation (5), 
Pe | second % Ahsh, E(t) pe?*, opens up. This path leads to a field induced excitation 
from the first valence band via the second valence band to the conduction band 
and is mediated by the polarization peo Thus only if a polarization pee is already 
present and the transition h, — hy is possible, this indirect path exists. However, 
pre itself depends initially linearly on the electric field. Therefore, the indirect 
path pp?* |indirect © hh, den, E(t)” depends quadratically on the electric field and is 
proportional to the product of the transition dipole-matrix elements d),y, for 
h, > h, and d.,, for he. For strong excitations this quadratic dependence is 
replaced by a series of terms which are of even order in the driving field. Overall, 
the transition h, > h,— e has an even symmetry with respect to the driving field. 

Combining direct and indirect paths, the sum of their amplitudes—not their 
probabilities—defines the total transition yield. Therefore, in the lowest order, the 
total polarization 


h h: hye 
Pr Ieotat = Pr lairect + PR lindirect © den, E(t) + dhyh, den, E(t)” (6) 


consists of linear and quadratic contributions. For strong excitations, the field 
dependence is replaced by functions which are of even and odd order in the driving 
field. The resulting quantum interference controls the symmetry of the excitation 
h, — e because direct and indirect paths involve different field orders, as depicted 
by the red and blue arrows in Extended Data Fig. 5d. Therefore, changing the sign 
or the phase of the driving field E(t) allows for direct control of the quantum 
properties of the excited state e. Equation (6) shows that all three transitions 
h,— e, hye, and h, — hy, are needed to produce quantum interference. If one 
transition A— 2’ is forbidden, that is, dj; =0, ripe ee can only have one contri- 
bution with either even or odd parity in E. 

Another contribution to the indirect path h, — h)— e is described by the third 
term in equation (5), Pe bisa =den, (HEH) pe hy Here, the polarization between 
the two valence bands mediates the transition to the conduction band. However, 
pe is not driven by the electric field directly and may thus have a different field 
dependence compared to the pre. mediated transitions, effectively creating an 
additional excitation path. 

In the full five-band system, the additional bands will greatly increase the 

number of possible excitation paths. Even transitions involving three bands, for 
example, hj > h, > h;— e, can become possible. The resulting excitation path 
will also contribute to the transition h, — e analysed above, leading to a quantum 
interference of at least three different excitation paths, with a possibly even more 
complicated field dependence. In the same way, excitations where the second 
valence band is the initial or final state, that is, h, > e and h, > hy, also contribute 
to the emission (direct paths). 
Perturbative versus non-perturbative quantum interference. In principle, 
quantum interference could also be created in the perturbative regime. 
However, the frequency and intensity of the driving field would have to be chosen 
carefully to balance the involved excitation paths that depend on different orders 
of the driving field. A slight change in the properties of the pump field would then 
lead to a preferred path and, as a consequence, destroy the quantum interference 
effect since that preferred path would mainly contribute to the emission. As shown 
in the main text, the unipolar emission as a sign of quantum interference is 
observed for a variety of driving field strengths and central frequencies, which is 
a strong indicator for non-perturbative coherent control. 

Owing to the very strong terahertz fields used in the experiments, the transition 
amplitudes are modulated by the electron occupations f;- of the involved states. 
The resulting massive dynamic occupation changes are beyond the validity of the 
perturbative analysis and the non-perturbative regime is reached. Technically, the 
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power-law dependence of transition amplitudes on the electric field E is replaced 
by some nonlinear functions, denoted by A,(E) and A,(E) for direct excitations 
(for example, h, — e) and indirect paths (for example, h, — h,— e), respectively. 
Nevertheless, as in the perturbative analysis, we can still identify an odd symmetry 
A,(—E) = —A,(E) for the direct path and an even symmetry A.(—E) = A,(E) for 
the indirect path. Also the superposition principle remains valid, implying pro- 
portionality A.(E) + A,(E) for the total transition amplitude composed of the two 
excitation paths. The sign of the field thus controls the total outcome of the 
excitations, yielding A.(|E]) + A,(|E|) for positive E and A,.(|E|) — A,(|E|) for 
negative E. Therefore, the system described by equation (5) inherently contains 
non-perturbative coherent control via quantum interference. Note that, while 
being assigned direct and indirect paths, the excitation paths are true multi-photon 
transitions in the non-perturbative regime, not perturbative single and two- 
photon transitions. 

Interference path efficiency. To determine the precise weight of direct versus 
indirect path contributions, we compare the total electron density, ne, , generated 
in the first conduction band with (F..= 1) and without (F..=0) the indirect 
paths. We define n., 300 fs after the terahertz field, that is, at a time when most 
of the carrier generation has been completed. We then construct the interference 
path efficiency 


Ne, (Fee = 1) — Ne, (Fee =0) 
Ne, (Foc = 0) 


(7) 


by computing the ratio of excess electrons created by the indirect paths 
(Ne, (Fec = 1) — ne, (Fec = 0)) and only by the direct paths (ne, (Foc = 0)). 

Extended Data Fig. 5e shows i7;pg as a function of the terahertz field strength. 
For low Eyy, Yypp remains close to zero, indicating the dominance of the direct 
paths. Increasing the external driving field strength to 22 MV cm * elevates 111g to 
50%. For Erp, = 30MV cm’, the direct and the indirect paths reach the same 
efficiency (pz = 1). In other words, non-perturbative excitations tend to balance 
the relative weights of the excitation paths, making interference effects strong, as 
described in the main text. We even observe that pz starts to decrease slightly, 
indicating a Rabi-flopping-type saturation, which is a unique hallmark of strongly 
non-perturbative excitations. 

Coherent control level. Extended Data Fig. 6 shows the computed HH intensity 
envelopes as a function of the coherent control factor F,, as presented in Fig. 2c. 
However, all time traces are now displayed on the same absolute intensity scale. 
Including all transition paths (F.. = 1) enhances the emission intensity by roughly 
30 times, that is, more than simple two-path interference (maximum enhance- 
ment by a factor of four) predicts. This is indeed expected since switching off 
indirect transition channels (F,.=0) eliminates the contribution of multiple 
polarization combinations in the emission intensity, which lowers the HHG effi- 
ciency drastically. Additionally, GaSe has a particularly strong dipole matrix-ele- 
ment between the hole bands, as explained in ref. 41. This contribution is also 
missing when F,, is set to zero, together with multiple interference pathways. By 
closing the indirect transition channels in the F,. = 0 calculation, the number of 
excited carriers is also reduced by a factor of two compared to the full calculation 
(F-< = 1). As a combination of all these effects, F.. = 0 yields a significant reduc- 
tion in HHG, besides the qualitative difference in temporal emission, compared to 
the full computation. 

Minimal requirements for non-perturbative quantum interference. In the 
main text, we have identified the following list of minimal conditions for the 
non-perturbative quantum interference: 

(1) The system must have at least three states that are all mutually dipole coupled. 
The dipole transitions can then be illustrated in a triangle diagram, as shown in 
Extended Data Fig. 10a. 

(2) The terahertz field must be non-resonant with each of the transitions because 
the excitation channels can otherwise not be balanced. 

(3) The terahertz field must be strong enough to generate non-perturbative excita- 
tions where populations change within less than one oscillation cycle of the excit- 
ing field. 

In the schematic of Extended Data Fig. 10a, circles denote three states (e, 1, and 
2) coupled with three dipole-allowed paths (arrows). The actual dipole moment 
between any two states 2 and 1’ follows from 

diy = a |e (herby (r) (8) 


Q 


TPE = 


where #,(r) is the single-particle wave function of a given state A. For solids, the 
integral can be performed over the unit-cell volume 2. 

An inversion symmetric potential, depicted schematically in Extended 
Data Fig. 10b, produces single-particle eigenfunctions that have either even 
(@(-n =¢,()) or odd (¢;,(-r) =—¢,(r)) parity. Inserting these into 


equation (8), dj, vanishes when both / and 2’ have the same parity and exists 
only between states with different parity. Since at least two out of the three states 
must have the same parity, an inversion symmetric system cannot form the 
required triangle system, making only A-, V- or ladder-transitions possible. 
Hence, the discovered non-perturbative quantum interference cannot be observed 
in inversion symmetric molecules or solids. More generally, inversion symmetric 
systems can have closed transition loops only among an even number of states, 
which always creates a total transition with odd symmetry with respect to the 
driving field. Inversion symmetric systems can therefore not exhibit the observed 
non-perturbative quantum interference even via more complex excitation paths. 
When the potential does not possess inversion symmetry as illustrated in Extended 
Data Fig. 10c, ¢,(r) has neither even nor odd symmetry. Owing to the indefinite 
parity, dj, can exist between any combination of states, as is the case in GaSe. 
Solids or molecules lacking inversion symmetry can, thus, exhibit three states 
which are dipole-connected in a triangle configuration. 

In Extended Data Fig. 10a, non-perturbative quantum interference can be 
realized in many different ways. One possibility is to choose 1— e to be the direct 
transition and 1 > 2 e the indirect path (scenario QI1). Equivalently, the direct 
transition 2— e can interfere with the indirect path 2— 1— e (scenario QI2). To 
determine the interference conditions in these two cases, we apply ¢, >@,e!” in 
equation (8) and get diy with states p, and dj, = lx), , We choose 0; to 
produce real valued dey = |de1 | and des = |dez | while dy remains complex valued. 
As derived in equation (6), the overall strength of the QI1 transition scales with 


Pe lon % [ter E(t) + dor den E(t)” | = des | E(t) +2E(t)"| (9) 


with z=|42 dy. Similar analysis for QI2 produces the expression within the 


el - 
absolute value in equation (9) with z= der di. Hence, both scenarios QI] and 


e2 
QI2 lead to constructive (destructive) interference for a positive (negative) sign of 
the driving field and thus qualitatively contribute in the same way to the overall 
outcome of HH emission. 

Ultrashort pulse shaping. The new cross-correlation scheme allows us to deter- 
mine the exact timing with respect to the driving terahertz waveform as well as the 
intensity of the emitted HH bursts. The relative weight of HH bursts emitted at 
different half-cycles of the phase-stable waveforms can be controlled via the car- 
rier-envelope phase (CEP) of the transients**!: while continuously varying the 
CEP of the driving field, we observe that the individual peaks of the HH pulse 
train shift under a fixed envelope (Extended Data Fig. 8), defined by the envelope 
of the driving waveform. Quantum interference between more than two energy 
bands causes a pronounced polarity dependence of HH emission, leading to a 
temporal spacing of the HH bursts by the full driving period T instead of T/2. This 
intrinsic property facilitates realization of isolated ultrashort field bursts even if the 
intensity envelope of the driving pulse is substantially longer than one half-cycle. 

If the CEP changes by m (Extended Data Fig. 8b) one may switch between one 
dominant HH pulse centred at the maximum field position (black curve) and 
two equally intense bursts corresponding to two positive driving half-cycles of 
equal field strengths (red curve). The direct CEP-controllability may thus allow 
for the generation of isolated ultrashort HH bursts as proposed in theoretical 
studies®*. 

The flat spectral phase within single emission bursts of HHs favours the emis- 
sion of extremely short optical pulses directly from the bulk semiconductor. Based 
ona semiclassical intraband calculation (supplementary information of ref. 5), we 
estimate that well-chosen spectral amplitude filtering could allow for the genera- 
tion of 3-fs short bursts (Extended Data Fig. 9) without external pulse compres- 
sion. Extending the subcycle pulse shaping techniques introduced in this study to 
higher frequencies by using wide-gap semiconductors as HH emitters may thus 
ultimately yield even subfemtosecond bursts from solid-state sources. 
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Extended Data Figure 1 | High-order harmonic (HH) spectrum generated — employing InGaAs and cooled Si detectors. At a frequency of 476 THz 
by intense phase-locked terahertz pulses in bulk GaSe. The ultrabroadband interband photoluminescence (PL) dominates the spectrum. The driving 
HH emission is recorded using a monochromator with a calibrated multi-terahertz waveform is shown in the inset. 

pyroelectric detector (PED), a lead sulfide diode (PbS) and spectrometers 
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Extended Data Figure 2 | Determination of the absolute timescale. 

a, Electro-optic signal of the multi-terahertz driving field as recorded using a 
BBO detector (thickness, 10 jm; blue curve, Sgos-ppo) and a (110)-oriented 
ZnTe crystal (thickness, 6.5 j1m; black dashed curve, Spos-znre) for spectral 
components between 12 and 45 THz. The black solid curve represents the 
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corresponding terahertz electric field Ep, as a function of delay time after 
correction for the complex-valued detector response of the ZnTe crystal 
displayed in b. b, Absolute value (black solid curve) and phase (red dashed 
curve) of the transfer function* calculated for a 6.5-m-thick ZnTe electro- 
optic detector in the multi-terahertz frequency range. 
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Extended Data Figure 3 | Double-blind reconstruction of the experimental _data set of main text Fig. 1. The measured spectral intensity of the sum- 
XFROG data. a, Retrieved spectral intensities of the detected HH pulse frequency signal is shown as a red shaded area for comparison. All spectra are 
sequence (grey, numerals indicate harmonic orders), the gating pulse (blue) normalized to their individual maximum and shifted in intensity for clarity. 


and the sum-frequency intensity (SF) for optimal temporal overlap of gating —_b, Temporal intensity profiles of the gating pulse as measured and 
and HH pulses (black dashed). All spectra displayed here correspond to the reconstructed via SHG-FROG (blue) and double-blind XFROG (black dashed). 
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Extended Data Figure 4 | Microscopic model. a, Tight binding band 
structure of GaSe with three valence bands (red) and two conduction bands 
(blue). The bandgap Eg is marked by the black arrow. b, Schematics of 
interband transitions. The external electric field (orange curve) stimulates a 


transition between electronic states (blue spheres) in different bands via the 
dipole moment dh, ¢,, indicated by the black arrow. c, Schematics depicting 
intraband currents. Carriers (blue spheres) are accelerated by the external 
electric field (orange curve) inside their respective bands. 
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Extended Data Figure 5 | Quantum interference paths. a, Simplified band- 
structure schematics consisting of two valence bands h, and hp (red lines) and 
one conduction band e (blue line). The allowed transitions are labelled and 
marked by black arrows. b, Schematic of the direct excitation path. The 
transition h, — e is marked by the blue arrow. ¢, Indirect excitation path 
consisting of the transition h, — h, (red arrow) and h, — e (three-dimensional 
red arrow). d, Full quantum interference (QI): combination of the direct 
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excitation path h, > e (blue arrow) and the indirect excitation path 

h, = hy e (red arrows). e, The interference path efficiency mpp(Eruz), black 
line, describes the fraction of additional carriers promoted to the conduction 
band e, by indirect paths such as hj > h, > e as a function of the driving 
terahertz field strength. For the peak fields used in our calculations (indicated 
by dashed red lines), the strength of indirect excitations is of the same order 
of magnitude as the strength of direct excitations (7pzO(1)). 
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coherent transitions between occupied valence bands. The same data are 
presented in Fig. 2c in normalized form. Bright colours mark strong emission, 


Extended Data Figure 6 | Non-perturbative quantum interference. Globally 
dark colours mark weaker emission (colour key at top right). 


normalized HH intensity envelopes computed within the five-band model 
as a function of delay time and the coherent control factor F., regulating 
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Extended Data Figure 7 | Temporal phase of the HH emission from GaSe. __ the detection bandwidth has been subtracted for better visibility: 


The black curve depicts the retrieved relative phase of typical HH bursts for dun(t) = dyy(t) — 2mvct. The intensity envelope Iy3; (blue shaded area) 
driving peak fields of 33 MV cm. A trivial linear contribution to the and the derived instantaneous frequency 1; of the three main HH pulses 
phase representing the weighted central frequency v, of detected HHs within _(red curves) are shown on the same timescale. 
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Extended Data Figure 8 | HH pulse shaping via the CEP of the driving ¢cep of the driving waveform. b, Reconstructed HH intensity envelopes for 


waveform. a, False-colour plot of the reconstructed intensity envelopes of the cp = 0 (black) and dcgp = 1 (red). 
HH pulse sequence versus delay time t for different carrier-envelope phases 
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Extended Data Figure 9 | Ultrashort pulse shaping via spectral amplitude model. Inset, calculated amplitude spectrum driven by a multi-terahertz 


filtering. Intensity envelope (blue shaded area) of spectrally filtered HH transient with external peak fields of 43 MV cm_’ as emitted from the sample 
bursts (inset, blue shaded spectrum) emitted from GaSe featuring a full-width (grey shaded) and spectrally filtered (blue shaded) with a suitable high-pass 
at half-maximum of 3 fs only, as calculated with a semiclassical intraband filter (black dashed). 
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Extended Data Figure 10 | Triangle system and symmetry. a,Schematicofa (black solid line) and the resulting wave functions with even (red shaded area) 
triangle system of electronic states as a minimal condition for non-perturbative and odd (blue shaded area) parity. c, System with an asymmetric potential 
quantum interference. Filled circles 1 and 2 denote occupied states, whereas (black line) resulting in wave functions with indefinite parity (orange shaded 
the sphere e symbolises an empty state. b, A symmetric potential areas). See Methods for details. 
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Flexible high-temperature dielectric materials 
from polymer nanocomposites 


Qi Li', Lei Chen’, Matthew R. Gadinski', Shihai Zhang”, Guangzu Zhang’, Haoyu Li’, Aman Haque’, 


Long-Qing Chen!', Tom Jackson? & Qing Wang! 


Dielectric materials, which store energy electrostatically, are 
ubiquitous in advanced electronics and electric power systems’ *. 
Compared to their ceramic counterparts, polymer dielectrics have 
higher breakdown strengths and greater reliability’ *”, are scalable, 
lightweight and can be shaped into intricate configurations, and 
are therefore an ideal choice for many power electronics, power 
conditioning, and pulsed power applications’®”°. However, poly- 
mer dielectrics are limited to relatively low working temperatures, 
and thus fail to meet the rising demand for electricity under the 
extreme conditions present in applications such as hybrid and 
electric vehicles, aerospace power electronics, and underground 
oil and gas exploration’’’. Here we describe crosslinked polymer 
nanocomposites that contain boron nitride nanosheets, the dielec- 
tric properties of which are stable over a broad temperature 
and frequency range. The nanocomposites have outstanding 
high-voltage capacitive energy storage capabilities at record tem- 
peratures (a Weibull breakdown strength of 403 megavolts per 
metre and a discharged energy density of 1.8 joules per cubic cen- 
timetre at 250 degrees Celsius). Their electrical conduction is 
several orders of magnitude lower than that of existing polymers 
and their high operating temperatures are attributed to greatly 
improved thermal conductivity, owing to the presence of the boron 
nitride nanosheets, which improve heat dissipation compared to 
pristine polymers (which are inherently susceptible to thermal 
runaway). Moreover, the polymer nanocomposites are lightweight, 
photopatternable and mechanically flexible, and have been demon- 
strated to preserve excellent dielectric and capacitive performance 
after intensive bending cycles. These findings enable broader appli- 
cations of organic materials in high-temperature electronics and 
energy storage devices. 

The best commercially available dielectric polymer represented by 
biaxially oriented polypropylene (BOPP) can operate only at tem- 
peratures below 105 °C (ref. 14). Therefore, thermal management is 
always required to enable the use of dielectric polymers in high- 
temperature applications. For example, to accommodate BOPP film 
capacitors in the power inverters of hybrid and electric vehicles, 
which are used to control and convert direct current from batteries 
into the alternating current required to power the motor, cooling 
systems have to be employed to decrease the environmental temper- 
ature from about 140°C to about 70°C. This brings extra weight, 
volume and energy consumption to the integrated power system and 
reduces its reliability and efficiency. The upsurge in lightweight and 
flexible electronic devices has also created a tremendous demand for 
high-temperature dielectric polymers, as the heat generated by elec- 
tronic devices and circuitry increases exponentially with miniatur- 
ization and functionality. 

A variety of high-performance engineering polymers have been 
considered as possible high-temperature dielectric materials to address 
these urgent needs'*"’. Until now, the key criteria established for 


evaluating high-temperature dielectric polymers has been the glass 
transition temperature (T,) and thermal stability. At temperatures 
approaching T,, polymers lose their dimensional and electromech- 
anical stability and display large variations in dielectric constant (K) 
and dissipation factor (DF) with temperature. Making use only of 
materials that have high T, works reasonably well at high temperatures 
but only under relatively low electric fields and this approach has 
had very limited success when the material is subject to both high 
temperatures and high voltages. 

Here (see Fig. 1a) we thermally crosslinked divinyltetramethyldisi- 
loxane-bis(benzocyclobutene) (BCB) in the presence of boron nitride 
nanosheets (BNNSs, Fig. 1b and c) to afford the crosslinked nanocom- 
posite c-BCB/BNNS (Fig. 1f and g, Supplementary Information sec- 
tion 1). BNNSs, which form a wide-bandgap (~6 eV) insulator with 
ultrahigh thermal conductivities in the range ~300-2,000 Wm *K™* 
(refs 20 and 21), were prepared through liquid-phase exfoliation of 
hexagonal boron nitride (h-BN) powders”. 

Compared to the crosslinked pristine BCB referred to as c-BCB, the 
most striking feature of c-BCB/BNNS is substantially suppressed high- 
field electrical conduction at high temperatures (Supplementary 
Information sections 2 and 3). For example, the electrical conductivity 
decreases from 4X10 '?Sm" in c-BCB to 9.2X 10 “Sm ° in 
c-BCB/BNNS and the conduction loss decreases from 18% in c-BCB 
to 3% in c-BCB/BNNS under an applied field of 200 MV m“' at 150°C. 
Coupled with the higher Young’s modulus arising from the introduced 
BNNSs, which impedes the occurrence of the electromechanical break- 
down”, the largely reduced electrical conduction in c-BCB/BNNS 
results in a greatly improved Weibull breakdown strength (E,) at high 
temperatures. For example, E, improves from 262 MV m' for c-BCB 
to 403 MV m | for c-BCB/BNNS with 10 vol% BNNSs at 250°C. We 
note that polymer nanocomposites were previously designed towards 
improved capacitive energy storage, including BNNS-containing nano- 
composites, and were mainly intended for room temperature applica- 
tions’. For example, at E, = 200 MV m— 1 the dielectric loss of the best 
ferroelectric polymer nanocomposite with BNNSs is 15% at room tem- 
perature, but this quickly rises to 76% when the temperature is 
increased to 70 °C (Supplementary Fig. 25). 

The dielectric properties of c-BCB/BNNS have been evaluated along 
with state-of-the-art high-temperature capacitor-grade polymer films 
including polycarbonate (PC, T, ~ 150°C), poly(ether ether ketone) 
(PEEK, T, ~ 150°C), polyetherimide (PEI, Ty = 217°C), fluorene 
polyester (FPE, T,~ 330°C) and polyimide (Kapton PI (from 
Dupont), T, ~ 360 °C) (Supplementary Table 1). We first examine K 
and DF as a function of temperature and frequency (Fig. 2, 
Supplementary Information section 4). At 10* Hz, which is the 
frequency of interest for common power conditioning, a minor vari- 
ation in K with temperature, that is, <1.7%, is seen in c-BCB/BNNS 
from room temperature to 300 °C, while FPE, the next-best dielectric 
investigated in this study, shows a K variation of over 8% at 300°C 
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Figure 1 | Material preparation and 
structures. a, Schematic of the 
preparation of c-BCB/BNNS films. 

b, c, Transmission electron 
microscopy (TEM) images of BNNSs 
exfoliated from h-BN powders. Inset 
to c is an electron-diffraction pattern 
of BNNSs, showing its hexagonal 
symmetry. d, Chemical structure of 
the BCB monomer. e, The repeating 
unit of c-BCB. f, Photographs of a 
10-pm-thick c-BCB/BNNS film 
wrapped around a glass tube with 
diameter 4 mm. g, A bent 10-um- 
thick c-BCB/BNNS film. h, The 
photopatterned c-BCB/BNNS on a Si 
wafer. i, j, Optical microscopic images 
of the patterned films; the dark 
regions correspond to c-BCB/BNNS. 


Soft baking 


relative to room temperature (Supplementary Fig. 15). As presented in 
Fig. 2c, the temperature coefficient of K for c-BCB/BNNS is around 
65 parts per million (p.p.m.) per °C, compared to 308 p.p.m. °C | and 
498 p.p.m. °C”! for FPE and Kapton, respectively, within the temper- 
ature range 25-300°C. Even under a direct-current bias voltage of 
50 MV m , the K variation of c-BCB/BNNS is still as low as 1.6% at 
250°C, compared to 8.5% for FPE (Supplementary Fig. 15). 


Concurrently, the DF value of c-BCB/BNNS at 10* Hz only 
increases from 0.09% to 0.13% with increasing temperature up to 
300 °C (Supplementary Fig. 15). Although Kapton shows a stability 
of DF with temperature under direct-current bias similar to that of 
c-BCB/BNNS, appreciable increases in DF have been observed in all 
the other polymer dielectrics. For example, the DF of FPE jumps from 
0.22% at room temperature to 1.35% at 280 °C. It is also evident that, of 


a 50 b ee ee Figure 2 | Dielectric stability. 
104 Hz 0.015 + 104 Hz a, Temperature dependence of 
4.5, == GECBIENNS dielectric constant. b, The DF of 
40 PEEK + A 0.012 L OPC c-BCB/BNNS with 10 vol% of BNNSs 
_. =—A— PEEK and high-T, polymer dielectrics. 
= rs 
3s 5 0.009 + —y— PEI c, Temperature coefficient of the 
5 3 =e Epe dielectric constant of c-BCB/BNNS 
pe = mies with 10 vol% of BNNSs and high-T, 
Bos —— Kapton polymer dielectrics at various 
2 = temperature ranges. d, Frequency 
O 20 fa 0.003 dependence of the dielectric constant 
Reisner and DF of c-BCB/BNNS with 10 vol% 
1.5 0.000 F of BNNSs at different temperatures. 
iol ty Error bars show standard deviation. 
50 100 150 200 250 300 50 100 150 200 250 300 
Temperature (°C) Temperature (°C) 
c -800 d 3.5 0.020 
MMMc-BCB/BNNS [MMFPE 10¢Hz 
- i -700} as Kapton wa PE! 3.0 =F — 8 
5 : lB PEEK al 40.015 : 
es fa 250 8 
B= of estan joot0 ¢ 
§ 2 15 —a— 200 °C e 
oe —v— 250°C 40.005 2 
ge 1.0+ a 
Oo —r 4 
& 2 ne = = SESS 1 0=5 0.000 
25-150 25-200 25-250 25-300 ere 10° 10° 10° 10° 
Temperature range (°C) Frequency (Hz) 


30 JULY 2015 | VOL 523 | NATURE | 577 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


aos b Figure 3 | Electrical energy storage capability. 
a —=—c-BCB/BNNS 150 °C a [ = —s—s—5__, 150 °C Discharged energy density and charge-discharge 
5 ppl. =o=e = | & 90- mis co ‘ o—, efficiency of c-BCB/BNNS with 10 vol% of BNNSs 
d = A 2 : 
= wae ra 3 80 oe a. ia and hil, polymer dielectrics measured at 
2 2 © ah SS 150°C (a, b), 200°C (c, d) and 250°C (e, f). 
Q -v— (S) fo} 
245 ‘7— Kapton ge key © 7 
(3) 
zo) == ® 60- " 
> the a ® ¢9[-@—c-BCB/BNNS \ \, 
B | gL ~<IH PEEK ye z 
o 1. = 4ghmOm Pc A 
5 fe) ‘A 9 
3 La ° 39 [-O— FPE M4 \ 
0.5 —v— Kapton 
8 ~ mee 4 4 a 
5 & 140 [—O= PEl 
2 00 Oy LEt= PEEK 
50 100 150 200 250 300 350 400 50 100 150 200 250 300 350 400 
Electric field (MV m1) Electric field (MV m~) 
© 25 
| 200 °C ee oe 200 °C 
—"— c-BCB/BNNS oot $=—8——#—~a__, 
2.0; —A—FPE 2 ia, 
fr ial ©. *—,. 
—v— Kapton J 70l oo 
1.5 i 
= PEI x sol : 


° 


Discharged energy density (J cm-°) 
Charge-discharge efficiency (%) 


ie ea 


*, 


Electric field (MV m1) 


0.5} Gg 30- aj FPE 4a 
L DA 20k 
A =—v— Kapton \ 
0.0} °= 10- 
—— PEI “ 
1 1 1 1 1 1 1 1 0 i a: 1 1 i 1 1 L 
50 100 150 200 250 300 350 400 50 100 150 200 250 300 350 400 


Electric field (MV m) 


V 


—@8— c-BCB/BNNS 
=—A— FPE 
=v— Kapton 


€ = —@8— c-BCB/BNNS enn oy 
315 747 FPE ra > 

2 —v— Kapton os & 

5 f € 

so 1.0 t) ra) 

> ra g 

o ue s 

7 [3] 

= 0.5 a 2 45 
® a Ay A 
2 ZX & 20 
oO —— Ss 

5 0.0 iy = 10 
2 (6) 

a f . i i : f 0 


n 1 . 
50 100 150 200 250 300 350 400 50 


Electric field (MV m7") 


the dielectrics assessed, c-BCB/BNNS offers the most stable K and DF 
in the frequency range 10°-10° Hz at high temperatures (Fig. 2d, 
Supplementary Figs 16 and 17). 

We next studied high-field capacitive energy storage properties at 
high temperatures (Supplementary Information section 5). As sum- 
marized in Fig. 3, c-BCB/BNNS clearly outperforms all the high-T, 
polymer dielectrics at temperatures ranging from 150°C to 250°C in 
terms of the discharged energy density (U.) and the charge—discharge 
efficiency (1). For example, c-BCB/BNNS can discharge a U, exceeding 
2.2Jcm * under 400 MV m“' with a7 of larger than 90% at 150 °C. At 
200 °C, c-BCB/BNNS delivers a U. of 2 J cm” * under 400 MV m_’, 
which is twice that of PEI, accompanied by a 77 value more than 1.5 
times higher than that of PEI. As the temperature is further raised to 
250 °C, where none of the high-T, polymer dielectrics can operate at 
more than 150 MV m_', c-BCB/BNNS is functional up to 400 MV m~! 
with a U, of ~1.8Jcm *. Remarkably, at 200 MV m_', which is the 
operating condition of BOPP film capacitors in electric vehicles”®, the 
value of 7 for c-BCB/BNNS at 150 °C—that is, ~97%—is the same as 
that of BOPP at 70 °C (Supplementary Fig. 24). This indicates that, by 
replacing BOPP with c-BCB/BNNS, the complex cooling system for 
power inverters in electric vehicles could be eliminated. Furthermore, 
under these conditions, the U. of c-BCB/BNNS is over 40% higher than 
that of BOPP owing to its higher K; that is, 3.1, versus 2.2 for BOPP. 

The superior performance of c-BCB/BNNS over the high-T, poly- 
mer dielectrics stems from its substantially reduced high-field leakage 
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current at elevated temperatures. For example, at 200 °C and a field of 
200 MV m1}, a current density of 4.2 X 10 ®Acm ” is found in 
c-BCB/BNNS, which is nearly one order of magnitude lower than that 
of PEI and two orders of magnitude smaller than those of FPE and 
Kapton (Supplementary Fig. 28). It is important to note that the elec- 
trical conduction not only accounts for dielectric loss, which degrades 
U, and y, but also generates Joule heating within dielectrics”. 
Depending on the heat dissipation (which is determined primarily 
by the thermal conductivity of the dielectrics), the geometry of the 
capacitors, and the cooling systems, the steady-state internal temper- 
ature of capacitors could exceed the T, or even the decomposition 
temperature of dielectric polymers and cause capacitor failure. 

We simulate the steady-state internal temperature distribution of the 
dielectric films by using finite element computations*® (Methods, 
Supplementary Information section 6). As summarized in Supple- 
mentary Tables 4-6, it can be seen that the steady-state internal tem- 
peratures of c-BCB/BNNS film are consistently much lower than those 
of the high-T, polymers operating under the same conditions, owing to 
a pronounced reduction in conduction loss and a marked enhancement 
in thermal conductivity, that is, from ~0.2 Wm ~ 1k"! for the polymers 
to 1.83Wm / K |! for c-BCB/BNNS. As exemplified in Fig. 4, under a 
forced convection with air (convective heat transfer coefficient h = 35) 
and an ambient temperature of 200 °C, the PEI-, FPE- and Kapton- 
based film capacitors are overheated, with temperatures at the film 
centres of 219 °C, 361 °C, and 435°C, respectively, at 200 MV m. 
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Figure 4 | Steady-state temperature distribution. Simulated steady-state 
temperature distribution in spiral-wound film capacitors of 40 X 40 X 10 mm? 
enclosure size, based on c-BCB/BNNS with 10 vol% of BNNSs, PEI, FPE and PI 
(Kapton), respectively, given continuous operation under an electric field of 
200 MV m ', with an ambient temperature of 200 °C and a convective heat 
transfer coefficient of 35. As the model is symmetric in three Cartesian 
coordinates, only 1/8 of the volume is shown, to expose the centre part of 
the capacitor. 


In contrast, the highest temperature inside c-BCB/BNNS is only 
204 °C. With the field increasing to 300 MV m ! and 400MVm ', 
the temperatures at the film centre of c-BCB/BNNS are 213°C and 
255 °C, respectively, still below its T, (>350°C), when h = 200. 
Notably, even at 250 °C and with active liquid cooling, c-BCB/BNNS 
is operable at 300 MV m_' with a maximum internal temperature of 
308 °C. Under high-voltage cycling conditions, it is in fact thermal 
runaway” that dictates the maximum operation field of dielectric 
polymers and thus acts as the limiting factor of high-field capacitive 
energy storage at elevated temperatures. Despite high T, and excellent 
thermal stability from engineering polymers, their well documented 
poor thermal conductivities*® seriously limit the actual working tem- 
peratures of capacitors under high fields. 

Finally, we demonstrate that c-BCB/BNNS can be readily prepared 
by photo-polymerization of solution-cast films, which, upon further 
curing, are found to possess essentially the same dielectric properties as 
the thermally crosslinked films (see Methods and Supplementary 
Fig. 27). The ultraviolet-induced crosslinking through an optical mask 
enables direct photopatterning of c-BCB/BNNS films (Figs 1h-j); this 
is highly desirable in device fabrications. Furthermore, no degradation 
in dielectric stability, U. and 4 of c-BCB/BNNS measured at room 
temperature and 250°C was observed after rigorous winding and 
bending tests (Supplementary Information section 7 and Supple- 
mentary Video). This suggests that this nanocomposite may be used 
in practical flexible electronics and high-throughput roll-to-roll pro- 
cessing into wound cells. Also, it is noteworthy that c-BCB/BNNS has 
the lowest mass density (~1.10gcm *) of the polymer dielectrics 
studied (Supplementary Table 1). This desirable combination of pro- 
cessibility, flexibility, light weight, and dielectric and capacitive per- 
formance in such nanocomposites may transform the way compact 
power modules and power circuits targeted for harsh environment 
applications are built. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Materials. Dipropylene glycol dimethyl ether (DMM), BCB monomers and 
b-staged BCBs (partially polymerized) with the number average molecular weight 
of ~25,000 were provided by Dow Chemical. Boron nitride powders were pur- 
chased from Sigma-Aldrich. The high-T, polymer dielectric films were provided 
by PolyK Technologies. The polyimide (Kapton) films were vacuum dried over- 
night at 70 °C before use. All the other materials were used as received. 
Preparation. BNNSs were prepared from boron nitride powders using a solution 
phase exfoliation method”*. To make the nanocomposites, BNNSs were first dis- 
persed in DMM ata concentration of 5 mg ml‘. 100 mg of BCB monomers were 
dissolved in 2 ml of DMM and stirred for 2 h. Afterwards the DMM solution of 
BNNSs was mixed with BCB solution in proportion, and the mixture was first 
stirred for 10 min and then sonicated for 5 min using a tip-type sonicator (175 W). 
To crosslink the material, the mixture was drop-cast on a glass slide and subject to 
baking at 120 °C for 30 min, which was followed by curing at 250 °C for 2 h under 
Np. The film was peeled off after soaking in water for 5 min. The film thickness can 
be varied by tuning the concentration of the cast solution. The thickness of films 
used for electrical characterizations is within the range of 6-12 um. For ultraviolet- 
induced crosslinking of the material, DMM solution of b-staged BCB mixed with 
BNNSs was used to cast a film on Si wafer by spin-coating. Then the spin-coated 
film was subject to a 20-min ultraviolet exposure in an ultraviolet crosslinker (XL- 
1500, Spectroline) equipped with ultraviolet tubes (BLE-1T155, Spectroline). The 
thickness of the film can be varied by changing the solution concentration and 
parameters of spin-coating. Typically, a 30 wt% solution spin-coated at 700 r.p.m. 
yields a 4-jum-thick film, which, after curing at 250°C for 15min under N,, 
achieves the same level of performance as that of the thermally crosslinked films. 
For a typical procedure of photopatterning, the material was spin-coated on Si 
wafer from a 20 wt% solution of b-staged BCB mixed with BNNSs at 3,000 r.p.m., 
and then covered with an optical mask before ultraviolet irradiation. Afterwards, 
the material was developed using DMM and dried. 

Characterization. Fourier-transform infrared (FTIR) spectra were obtained in the 
attenuated total reflectance (ATR) mode using a ZnSe crystal as a contact to the 
samples with a Varian Digilab FTS-8010 spectrometer. Differential scanning 
calorimetry was conducted by using a TA Instrument Q100 differential scanning 
calorimeter at a heating/cooling rate of 10 °C min’. X-ray diffraction analysis was 
studied using a PANalytical X’pert Pro MPD theta-theta diffractometer. 
Thermogravimetric analysis was performed with a TGA 2050 Analyzer at a heat- 
ing rate of 10°C min” '. TEM images were obtained on a JEOL JEM-2001F trans- 
mission electron microscope. Scanning electron microscopy measurements were 
performed with a FEI Nova NanoSEM 630 field emission electron microscope. 
Gold electrodes of diameter 6 mm and thickness 60 nm were sputtered on both 
sides of the polymer films for the electrical measurements. Dielectric spectra were 
acquired over a broad temperature range using a Hewlett Packard 4284A LCR 
meter in conjunction with a Delta Design oven model 2300. Dielectric spectra 
under direct current bias were collected with the same equipment along with a 
Hewlett Packard 4140B pA meter/voltage source, a KEPCO BOP 1000M amplifier 
and a protective circuit. Conduction currents were obtained under an electric field 
provided by a Hewlett Packard 4140B pA meter/voltage source and TREK model 
2210 amplifier. High-field electric displacement-electric field loops were collected 
using a modified Sawyer-Tower circuit, where the samples were subject to a 
triangular unipolar wave with a frequency of 10 Hz. Dielectric breakdown strength 
measurements were performed on a TREK P0621P instrument using the electro- 
static pull-down method under a direct-current voltage ramp of 500Vs"1. 
Young’s moduli were derived from strain-stress curves measured with a TA 


RSA-G2 Solids Analyzer, using a constant linear stretching rate of 0.02%s"’. 


See Supplementary Information section 6 for the method of thermal conductivity 
measurement. 

Two-parameter Weibull statistic. Dielectric breakdown strength is analysed 
within the framework of a two-parameter Weibull statistic described as: 


P(E) =1— exp(—(E/a)*) 


where P(E) is the cumulative probability of electric failure, E is the measured 
breakdown field, the scale parameter « is the field strength for which there is a 
63% probability for the sample to breakdown (Weibull breakdown strength), the 
shape parameter f evaluates the scatter of data and a higher value of / represents 
greater dielectric reliability. 

Temperature coefficient of dielectric constant. The temperature coefficient of 
dielectric constant, t,,, for a given temperature range (from T; to T;), is defined as: 


Te, = (Kr —Kj)/[Kret (T? _ T;) 


where K, cis the dielectric constant at room temperature, T; and T;are the low-end 
and high-end temperatures, respectively, and K; and Krare the dielectric constants 
at T; and T;, respectively. 

Simulation of steady-state temperature distribution. The full-field temperature 
evolution in the metallized thin film capacitor during joule heating is mathemat- 
ically governed by 


OT (x) 
ot 
where p,, and C are the density and heat capacity respectively. K is the thermal 
conductivity and E stands for the applied electrical field. In particular, as shown by 


the experimental measurements, the electrical conductivity o depends on the 
temperature T, having the characteristic form of: 


= exp(- ol (2) 


where the coefficients o9 and A for each material at one specific applied electric 
field are calculated by fitting the measured data and kg is the Boltzmann constant. 
Setting OT(x)/0t = 0 yields the steady-state temperature solution of equation (1): 


KV°T(x) +0(x,T)E’ =0 (3) 


PmC =KV°T(x)+0(x,T)E (1) 


The governing equation is then solved by finite element simulations using the 
commercial software Comsol 5.0 (http://www.comsol.com/release/5.0). When the 
capacitor is operating the thin slices of the structures are packed into an enclosure, 
with the cooling liquid surrounding the enclosure. Therefore, the capacitor is 
treated as an integrated body with a uniform applied electric field for each cal- 
culation. The geometries arising from the experimental setup are used in the finite 
element model. As a result of the laminar structure of the capacitor, in which the 
polymer has a relatively low thermal conductivity compared to the metallization, 
the anisotropy in thermal the conductivity is considered in the model (as shown in 
Supplementary Information section 6). During the finite element simulation, the 
heat flux at all the enclosure surfaces of the capacitor is produced by both con- 
vective and radiative heat transfer arising from the surrounding cooling, with the 
form: 


n(—KVT) =h(T — Text) + ospe(T* — Th) (4) 


where T,x; is the surrounding temperature, h is convective heat transfer coefficient, 
dsp is the Stefan—Boltzmann constant, and ¢ is the emissivity of the surfaces. 
Owing to symmetry only 1/8 of the volume is modelled. Based on the mesh- 
sensitive study, the number of mesh elements in the system is set as 
120 X 120 X 30 with a grid spacing of 0.33 mm. 
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Onset of Antarctic Circumpolar Current 
30 million years ago as Tasmanian Gateway 


aligned with westerlies 


Howie D. Scher’, Joanne M. Whittaker’, Simon E. Williams’, Jennifer C. Latimer’, Wendy E. C. Kordesch? & Margaret L. Delaney® 


Earth’s mightiest ocean current, the Antarctic Circumpolar 
Current (ACC), regulates the exchange of heat and carbon between 
the ocean and the atmosphere’, and influences vertical ocean struc- 
ture, deep-water production’ and the global distribution of nutri- 
ents and chemical tracers’. The eastward-flowing ACC occupies a 
unique circumglobal pathway in the Southern Ocean that was 
enabled by the tectonic opening of key oceanic gateways during 
the break-up of Gondwana (for example, by the opening of the 
Tasmanian Gateway, which connects the Indian and Pacific 
oceans). Although the ACC is a key component of Earth’s present 
and past climate system’, the timing of the appearance of dia- 
gnostic features of the ACC (for example, low zonal gradients 
in water-mass tracer fields*”) is poorly known and represents 
a fundamental gap in our understanding of Earth history. Here 
we show, using geophysically determined positions of continent- 
ocean boundaries*, that the deep Tasmanian Gateway opened 
33.5+1.5 million years ago (the errors indicate uncertainty in 
the boundary positions). Following this opening, sediments from 
Indian and Pacific cores recorded Pacific-type neodymium isotope 
ratios, revealing deep westward flow equivalent to the present-day 
Antarctic Slope Current. We observe onset of the ACC at around 
30 million years ago, when Southern Ocean neodymium isotopes 
record a permanent shift to modern Indian-Atlantic ratios. Our 
reconstructions of ocean circulation show that massive reorgan- 
ization and homogenization of Southern Ocean water masses coin- 
cided with migration of the northern margin of the Tasmanian 
Gateway into the mid-latitude westerly wind band, which we recon- 
struct at 64° S, near to the northern margin. Onset of the ACC 
about 30 million years ago coincided with major changes in global 
ocean circulation’ and probably contributed to the lower atmo- 
spheric carbon dioxide levels that appear after this time’. 

The development of a deep oceanic passage in the Tasmanian 
Gateway between Australia and Antarctica has been linked to the onset 
of the ACC"’. However, the relationship between the tectonic evolu- 
tion of the Southern Ocean and the appearance of diagnostic features 
of the present-day ACC is poorly known. The formation of the 
Tasmanian Gateway occurred as a result of the continental break-up 
of Gondwanaland during the late Cretaceous and Palaeogene peri- 
ods’. Australia and Antarctica separated about 83 million years ago 
(Ma), although seafloor around the Tasmanian Gateway subsided 
slowly’, and a seawater connection was delayed until after 49 Ma 
(ref. 14). The first ocean current in the Tasmanian Gateway was a 
shallow water current travelling from the Pacific into the Indian 
Ocean (that is, westward) under the influence of the polar easterly 
winds, as the gateway occupied a more southerly position in the middle 
Eocene™. Today, the eastward-flowing ACC is driven by the mid- 
latitude westerly winds, which produce a southerly pressure gradient 


force that balances wind-driven northward Ekman transport of surface 
water, resulting in a geostrophic current that extends to the seafloor’>. 
These findings indicate that onset of the ACC must have been accom- 
panied by a reversal in the water-mass tracer field through the 
Tasmanian Gateway. Moreover, the results imply that the position 
of the gateway relative to the wind field was essential for ACC onset'’. 
We test the viability of this hypothesis by reconstructing absolute 
palaeolatitudes of the gateway through time, the position of the polar 
front (that is, the boundary between the polar easterly and mid-latitude 
westerly winds) during the Oligocene epoch, and the evolution of the 
zonal gradient of neodymium (Nd) isotopes in the Southern Ocean. 

We reconstruct the tectonic opening of the Tasmanian Gateway” in 
a moving hotspot reference frame’’, using geophysically determined 
continent-ocean boundaries (COBs) from the South Tasman Rise and 
Antarctica®. Opening of the Tasmanian Gateway below around 500 m 
(that is, upper continental slope) is indicated by separation of the 
COBs (Fig. 1). Defining the innermost and outermost possible 
COBs—to account for uncertainty in the COB location—produces a 
range of ages for the final separation of Australia and Antarctica of 35- 
32 Ma (Fig. 2 and Extended Data Fig. 1). During this interval, the 
northern boundary of the nascent gateway was positioned to the south 
of 65° S. 

To determine whether the gateway was in the latitude band of the 
polar easterlies at this time, we reconstruct the palaeowind field by 
taking the palaeopolar front to be the boundary between the polar 
easterly and westerly winds. The present boundary between the mid- 
latitude westerlies and the polar easterlies coincides with a zone of 
deep-water upwelling just south of the polar front (Fig. 1a). We recon- 
struct the latitude of the Oligocene polar front by using microfossil 
assemblages from sediment cores”, restoring the 30-Ma positions of 
the cores used to demarcate the front (yellow dots in Fig. 1b; see also 
Extended Data Table 1). Combining these approaches, our reconstruc- 
tions confirm that the Tasmanian Gateway was positioned within the 
polar easterlies during the final separation of Australia from Antarctica 
(Fig. 2c). The absolute reference frame used does not alter the relative 
relationship between the northern edge of the gateway and the 
Oligocene polar front; however, it does have an influence on the abso- 
lute latitude of these features (Extended Data Fig. 2). 

To assess the response of ocean circulation to the changes in palaeo- 
geography indicated by the reconstruction of tectonic plates, we 
analysed the ratios of Nd isotopes found in fossil fish teeth from 
Ocean Drilling Program (ODP) sites 1124 (Hikurangi Plateau), 1168 
(Western Tasmanian Margin) and 1172 (East Tasman Plateau); see 
Extended Data Table 2 for details of the study sites. Fish teeth incorp- 
orate seawater Nd during a post-mortem mineralogical transforma- 
tion on the seafloor; the inherited Nd signal (yg, where éxq represents 
the *Nd/!“4Nd ratio of a sample relative to that of the bulk Earth, in 
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Figure 1 | Maps of the present-day Southern Ocean and relevant study sites. 
a, Map showing study sites, present-day zonal éyq distribution, and major 
frontal zones. Black squares with red borders show the present-day locations 
of the sediment cores used in this study. Black lines show the meridional 
extent of the major frontal zones associated with the ACC”. Circles show the 
locations of Southern Ocean ferromanganese (Fe-Mn) nodules found on 
the seafloor’. The colour of the circles shows the éyxq values of the surface 
layers; these surface layers are in equilibrium with overlying bottom water. 
The size of the circles reflects water depth. SAF, Sub-Antarctic Front; PF, 
Polar Front; SACCF, Southern Antarctic Circumpolar Current Front. 

b, Reconstruction of the early Oligocene (30 Ma) tectonic plates around the 
Tasmanian Gateway (the narrow gap between Antarctica and Australia). 
Continents with present-day shorelines are in black. Light grey indicates the 
continental shelf; dark grey denotes ocean basin/oceanic crust. White 

bands along the outer continental shelf illustrate the range between the 
outermost and innermost geophysical expression of the COB on the South 
Tasman Rise (light grey and white lobe south of Tasmania) and Antarctic 
conjugate margins’. Yellow circles indicate the reconstructed position of Deep 
Sea Drilling Program (DSDP) sediment cores, with microfossil assemblage 
data’’ used to reconstruct the Oligocene position of the polar front (yellow 
band). Red squares indicate the reconstructed positions of Ocean Drilling 
Program (ODP) sediment cores, used to obtain fossil fish tooth exq records 
in this study. The plate reconstruction was made using GPlates (http:// 
www.gplates.org). 
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parts per 10,000), which reveals bottom-water Nd ratios at the time of 
deposition, is not altered when the teeth become buried in pelagic 
sediments”. 

Seawater Nd comes mainly from continental crust and has a short 
residence time in seawater (tq = 300-1,000 years). Thus, water-mass 
éna is sensitive to water-mass mixing and changes in weathering 
inputs”’. Pacific water masses are significantly more radiogenic (that 
is, have more positive éyq values) than are Indian waters (Fig. 2), 
reflecting the predominance of juvenile mafic crusts that supply Nd 
to Pacific seawater. The large and continuous interbasin éxq gradient 
between the Pacific and Indian Oceans provides a useful means to 
monitor the communication of Pacific and Indian water masses 
through the Tasmanian Gateway (Fig. 2). 

The Nd found in fossil fish teeth in our study sites comes from 
Pacific and Indian deep waters and local terrigenous inputs. Sources 
of Nd isotope data pertaining to seawater and terrigenous inputs are 
given in Extended Data Table 3. Around the time of the opening of the 
Tasmanian Gateway, Pacific deep waters had éngq values of about — 4, 
and values in the Indian Ocean were between —6 and —8 (Fig. 2 and 
Extended Data Fig. 3). Today, the nearest source of terrigenous inputs 
to the Tasman margins is the Murray River, where dissolved and 
suspended material has gyq values of —6 and —4.7, respectively. 
Terrigenous inputs will be more influential in the shallow mixed layer 
(0-400 m in this region”), which was unlikely to be deeper than pre- 
sent during the study interval”. 

Nd isotopes are sensitive to water-mass mixing, so it is essential to 
know the water depth at the time of fish tooth deposition. We use 
subsidence curves for the Tasman sediment cores’ (see Methods) to 
provide constraints on water depth (Fig. 2a). The Western Tasmanian 
Margin (site 1168) was in shallow water (that is, less than 400 m) until 
32 Ma. The énq values at about 36 Ma are within the range of source 
rocks in southwest Australia (here represented by the Murray River; 
Fig. 2), indicating that the enq signature of the surface mixed layer was 
under the influence of nearby terrigenous inputs, consistent with mod- 
ern observations of Southern Ocean surface waters**. From 35 Ma 
to 32 Ma, énq values point to an enhanced advection of the warm, 
near-surface proto Leeuwin Current that accompanied early gateway 
opening”’. The pathway and éygq signature of this current is not well 
constrained, but, given its origin, an Indian Ocean signature is likely 
and its influence has been documented near the Australian coast”’. The 
East Tasman Plateau (site 1172) was within deep water (more than 
2,000 m; see Methods and Extended Data Fig. 4) over the study inter- 
val. Fossil fish tooth éyq values plot within the Pacific endmember field 
from around 36 to 30 Ma (where ‘endmember here refers to a chem- 
ically distinct water mass that is involved in water-mass mixing). 
Throughout this period, the Hikurangi Plateau (site 1124) recorded 
éna Values in the range of the Pacific endmember (grey line Fig. 2), 
consistent with its location and depth (Hikurangi Plateau basement 
lavas erupted in the early to late Cretaceous” and did not experience 
significant thermal or tectonic subsidence during the study interval). 

Once the Western Tasmanian Margin subsided into deeper waters 
(about 32 Ma; arrow in Fig. 2a), there ceased to be a significant éxq 
gradient between the study sites, indicating that the western Pacific 
and Indian sectors of the Southern Ocean (below the mixed layer) were 
influenced by a common water mass. Between 32 and 30 Ma the source 
of Nd was Pacific seawater, indicating that the first deep-water current 
to penetrate the Tasmanian Gateway was flowing westward from the 
Pacific to the Indian Ocean. This current regime affected slope depths 
on both sides of Tasmania, and was probably equivalent to the present- 
day Antarctic Slope Current (ASC). The ASC is found at slope depths 
between 66° S and 64° S in the western Pacific and Indian sectors of the 
Southern Ocean’®, is the deep-water counterpart to the Antarctic 
Counter Current, and is driven by the polar easterlies. The Pacific 
water signature is not observed farther to the west at Kerguelen 
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Figure 2 | Subsidence curves, Nd isotope records, and conjugate margin 
palaeogeography. a, Palaeodepth curves for seafloor on the Western 
Tasmanian Margin’’ (WTM; site 1168; blue) and East Tasman Plateau (ETP; 
site 1172; grey). The blue arrow indicates when site 1168 subsided below the 
mixed layer (black line at 400 m). b, Fossil fish tooth Nd isotope records 
from the WTM, ETP and Hikurangi Plateau (HP; site 1124; red). Error 
envelopes represent instrumental uncertainty (2c) based on replicate Nd 
isotope analyses of the JNdi-1 Nd isotope standard. Error bars (standard error) 
are shown for samples where the standard error exceeds 2¢ instrumental 
uncertainty. The Pacific and Indian—Atlantic éxq envelopes are based on fossil 
fish tooth Nd isotope records from the equatorial Pacific, southern Kerguelen 
Plateau, Ninetyeast ridge, and Maud Rise (Extended Data Fig. 3). The range 
of Murray River dissolved and particulate ¢yq is shown on the right axis. c, The 
palaeolatitudes of the conjugate margins of the Tasmanian Gateway outline 
the width and position of the gateway during opening. Light blue bands 
correspond to maximum gateway width, based on geophysically determined 
positions of the innermost COBs on the South Tasman Rise and Antarctica®, 
selected along the narrowest meridional transect of the gateway (Extended Data 
Fig. 1). Dark blue band shows minimum gateway width based on the outermost 
COBs. Error bars (standard error) on the boundaries of the gateway reflect 
uncertainty in the positions of the conjugate margins based on the 95% 
confidence intervals from the rotations’’. The Oligocene position of the polar 
front (yellow band) was derived from microfossil assemblages in deep-sea 
sediment cores’’, which were incorporated into this plate reconstruction. The 
grey band centred at 30 Ma highlights when the South Tasman Rise migrated 
north of the Oligocene polar front. 


Plateau or Maud Rise in the early Oligocene (Extended Data Fig. 3), 
illustrating that different sectors of the Southern Ocean had independ- 
ent deep-circulation regimes before the onset of the ACC—possibly a 
deep counterpart to the polar gyres suggested by microfossil distribu- 
tions and numerical simulations of surface currents'!”’. 

Between 30 Ma and 29 Ma, the éngq records at the study sites con- 
verge with the Indian—Atlantic endmember. This observation indi- 
cates that the Pacific sector of the Southern Ocean, at least as far 
west as the Hikurangi Plateau, came under the influence of a current 
that flowed eastward from the Indian to the Pacific. The only other éng 
record of equivalent age in the southwest Pacific (Fe-Mn crust Nova, a 
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hydrogenous deposit dredged from the Nova Canton Trough”®) shows 
a similar shift towards Indian-Atlantic seawater éyq values. Not only 
did the direction of deep-water flow reverse at 30 Ma, but the composi- 
tion of bulk sediment (that is, lithogenous Al/Ti ratios; Extended Data 
Fig. 5) on the East Tasman Plateau changed, from reflecting mafic 
sources (typical of Pacific sediments) to suggesting a source akin to 
average upper continental crust (typical of Indian—Atlantic sediments). 
The collapse of zonal gradients throughout the Southern Ocean is 
strong evidence for the onset of the ACC between 30 and 29 Ma. 

The collapse of zonal eng gradients in the Southern Ocean coincides 
with migration of the northern margin of the Tasmanian Gateway into 
the zone of the westerlies (that is, north of the Oligocene polar front; 
Fig. 2c). The major reorganization of oceanic circulation between 30 
and 29 Ma does not coincide with any significant change in subsidence 
around the gateway or with the initial opening of a deep gateway. 
Instead, we propose that northward migration of the gateway into 
the influence of the westerly winds established the conditions for geo- 
strophic balance in the Southern Ocean that drive the modern ACC. 
Although our findings do not preclude the influence of far-field tec- 
tonics (specifically, the influence of the Drake Passage) on the observed 
nq patterns, it is clear that the local interaction between the position of 
the continents and the position of the winds had a strong effect on 
patterns of Southern Ocean circulation. 

General circulation models predict a quantitative link between 
upwelling in the ACC and the strength of the Atlantic meridional over- 
turning circulation (AMOC)*”. The results of this study confirm this 
long-held view; collapse of zonal water-mass tracer gradients in the 
Southern Ocean at 30 Ma coincides with stronger AMOC and a shift 
towards the modern four-layer ocean structure after 30 Ma (ref. 9). 
Because the AMOC sequesters carbon into the deep sea as deep water 
forms in the North Atlantic Ocean, evolution of the modern ocean 
structure is consistent with an increase in the carbon storage capacity 
of the deep sea*. Thus, the onset of the ACC may help to explain the 
observed reduction in atmospheric CO, after 30 Ma (ref. 10), and ulti- 
mately the stabilization of the present-day icehouse climate state. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Methods of treatment of deep-sea sediments. Fossil fish teeth were hand-picked 
from the >150-um size fraction of washed samples. Fossil fish teeth samples 
consisted of two to five teeth each, ranging in mass from 10g to 100jp1g Nd. 
Sample replicates were created in several intervals where fish teeth were abundant. 
Samples were sonified in quartz-distilled water and methanol to remove debris 
from surfaces and cavities, then treated with reductive, oxidative, and weak acid 
steps*» *, Samples were transferred into pre-cleaned microcentrifuge tubes before 
and after the partial dissolution step. Following final transfer into clean tubes, 50 ul 
of distilled 0.25 M HCl was added to the microcentrifuge tubes to dissolve the 
samples before column chemistry. 

Nd isotope analyses. All of the samples used in this study were processed through 
the single column method”, with the only change being a doubling of the column 
length to improve separation of samarium. Cation exchange chemistry and Nd 
isotopic analysis of samples from ODP site 1172 were done in the W. M. Keck 
Radiogenic Isotope Facility at UC Santa Cruz. Cation exchange and isotopic 
analysis of samples from ODP sites 1168 and 1124 was done in the Center for 
Elemental Mass Spectrometry at the University of South Carolina. 

All measurements were made on a Neptune multiple collector inductively 
coupled plasma mass spectrometer (MC-ICP-MS) with an Apex HF or Apex Q 
as the introduction system. An X skimmer cone was used. All Nd isotope mea- 
surements were made in static mode, and each run consisted of 100 cycles with 4-s 
integration times, or 50 cycles at 8 s. Masses 142-150 were collected in cups L1 
through H4, with mass 146 in the centre cup. Prior to each analysis, all masses were 
measured for ten 8-s cycles for blank subtractions. Blank corrections were neg- 
ligible owing to effective washout of the previous sample. An Nd isotopic standard 
(for example, Ames Nd, JNdi-1) was run after every fourth sample as a consistency 
standard and to monitor accuracy and precision. All Nd isotopes were measured 
while monitoring masses 147 and 149 (Sm) allowing for interference corrections 
on 144, 148 and 150 (Nd). These corrections are negligible because of very small 
7S$m and ‘Sm intensities (0.01% of signal). Instrumental mass discrimination 
was corrected relative to '“°Nd/’“4Nd = 0.7219 using an exponential law. All data 
have been normalized to La Jolla = 0.511858 or JNdi-1 = 0.512115; the average 
7$m/'“4Nd ratio of samples from each site was used to make the age-dependent 
correction for the in-growth of radiogenic '*°Nd since the time of deposition (see 
Source Data for Fig. 2). 

REE+Y analyses. Sub-samples containing single fossil fish teeth were separated 
from cleaned samples for rare earth element and yttirium (REE+ Y) concentration 
measurements. Samples for concentration determinations were weighed on a 
microbalance and dissolved in a solution containing 2% HNO; spiked with 
2 p.p.b. (parts per billion) indium (In). Samples were measured on an Element2 
ICP-MS at the University of South Carolina. All samples were measured in high- 
resolution mode to avoid oxide interferences from polyatomic species containing 
barium and REEs. Concentrations of REE+Y were obtained by external cal- 
ibration using a mixed-element solution and a calibrated phosphate reference 
material, fossil bone composite™*. The blank was incorporated into the calibration 
curve by measuring the 2% HNO; solution in which the samples were dissolved as 
a calibration blank. 

Oligocene polar front. The past position of the polar front, which separates 
cold polar waters from warmer waters, is based on microfossil assemblage and 
lithological data from DSDP cores in the New Zealand sector of the Southern 
Ocean”. This approach is based on core top assemblages showing a clear 
relationship between calcareous and siliceous microfossils, relative to the posi- 
tion of present-day frontal zones. In this approach, the reconstructed polar 
front for a given time slice is drawn to the south of where calcareous micro- 
fossils are abundant (indicating that surface waters were relatively warm), and 
to the north of where siliceous microfossils are abundant (indicating that sur- 
face waters were relatively cold). Nelson and Cooke” use data from Deep Sea 
Drilling Program (DSDP) sites 267 & 268, 269, 280 & 281, 274 & 278 to 
determine the Oligocene (30-Ma) polar front. This position of the Oligocene 
polar front (Fig. 1) was incorporated into our tectonic reconstruction by back- 
tracking these sediment cores to their 30-Ma locations and redrawing the front. 
The specific information used to restore the Oligocene polar front is tabulated 
in Extended Data Table 1. 

Endmember water-mass ¢yq compilations and local sediment sources. The 
Palaeogene distribution of seawater Nd isotopes in the Pacific and the Atlantic— 
Indian sectors of the Southern Ocean has been reconstructed at medium to high 
resolution using fossil fish teeth from deep-sea sediment cores. The énq value of 
Pacific seawater has not demonstrated significant variability during the time 
interval investigated in this study**’*. Owing to the low resolution of the ferro- 
manganese (Fe-Mn) Nd isotope records, we have chosen to use a newly developed 
fossil fish tooth Nd isotope stack from the equatorial Pacific—IODP (Integrated 
Ocean Drilling Program) sites U1331, U1332, U1333, U1334 and U1335°”—as the 


Pacific endmember. The equatorial Pacific sites ranged from 2,500 to 4,500 m 
depth and 8°N to 2°S during the late Eocene to late Oligocene. The énq value 
for the Atlantic-Indian sectors of the Southern Ocean are compiled from ODP 
sites 689, 738, 757 and 1090 (refs 33, 38-42). The Nd isotope records used to draw 
the endmember fields in Fig. 2 are displayed in Extended Data Fig. 2. Information 
about the drill cores and Nd isotope data sources used to generate the endmember 
&na envelopes in Fig. 2, as well as the data sources for the local terrigenous inputs, 
are listed in Extended Data Table 3. The range of énq values carried by dissolved 
and particulate phases in the Murray River basin**“* is drawn on the right side of 
the figures. 

Bulk-sediment geochemistry. To aid our interpretation of Pacific-Indian com- 
munication, we used a bulk-sediment aluminium/titanium (Al/Ti) record from 
the East Tasman Plateau to distinguish between mafic (that is, Pacific) sediment 
sources (Al/Ti< 9) and Indian sources, with Al/Ti ratios closer to that of average 
continental crust (Al/Ti = 15.8)*°. A prominent shift from mafic sources to upper 
crustal values accompanies the Nd isotope shift and the timing of the northward 
migration of the South Tasman Rise margin north of the Oligocene polar front. 
Age models. Age models for ODP sites 1124, 1168 and 1172 are from refs 46-48. 
Subsidence models. The subsidence curves shown in Fig. 2a are based upon 
microfossil and facies analysis of the ODP Leg 189 sediment cores’’. As noted 
in ref. 13, there is a significant subsidence anomaly for the East Tasman Plateau, 
resulting from conflicting palaeodepth estimates from site 1172 and the Cascade 
Guyot'*“?*! (also called Cascade seamount in the literature). As shown in 
Extended Data Fig. 4, this discrepancy is ~1,900 m. 

An alternative interpretation is that shallow marine sediments below 356 m in 
site 1172 were first deposited on the top/flank of the Cascade Guyot and subse- 
quently transported to site 1172. We note that a 5-m section in ODP Site 1172 
(361 to 356 metres below sea level), interpreted as shallow marine facies by Stickley 
et al.'', has an up-section grain-size profile that is consistent with a classic Bouma 
sequence”, and may actually represent turbidite sequences shed from the sea- 
mount when it was still above wave base and/or sea level. This possibility was 
discussed in ref. 13. 

Given the interpretation that shallow marine sediments were transported by 
turbidites, we constructed the alternative subsidence model for ODP site 1172 
(shown in Fig. 2a) by adjusting the published curve from ref. 13. We deepen the 
published subsidence curve between 38 Ma and 34 Ma by 1,900 m to account for 
the transport of sediments from the Cascade Guyot to site 1172. There is a sudden 
change in facies in site 1172 at 357 m to fully offshore pelagic sedimentation, dated 
to 30.2 Ma (ref. 11). We interpret this transition as the cessation of turbidite 
transport to site 1172, possibly due to the subsidence of the Cascade Seamount 
beneath sea level and/or wave base. We use the published subsidence curve from 
ref. 13 for times younger than 30 Ma. For the poorly constrained period between 
34 Ma and 30 Ma, we linearly interpolate the subsidence. 

Our revised subsidence curve reconciles the available data from ODP site 1172 
and the Cascade Guyot. The East Tasman Plateau is located to the east of 
Tasmania, a passive margin formed when continental break-up occurred at 
~83 Ma between Australia and the Lord Howe Rise. Our revised subsidence curve 
is more consistent with the location of the East Tasman Plateau in such a tectonic 
setting, where subsidence rates some 50 million years after rifting are predicted to 
be low. Emplacement of the seamount yields thepotential for subsidence anom- 
alies in the late Eocene related to volcanism and subsequent crustal loading from 
the mass of the seamount. 


31. Boyle, E. A. Cadmium, zinc, copper and barium in foraminifera tests. Earth Planet. 
Sci. Lett. 53, 11-35 (1981). 

32. Boyle, E. A. & Keigwin, L. D. Comparison of Atlantic and Pacific paleochemical 
records for the last 215,000 years; changes in deep ocean circulation and 
chemical inventories. Earth Planet. Sci. Lett. 76, 135-150 (1985/86). 

33. Scher, H. D. & Delaney, M. L. Breaking the glass ceiling for high resolution Nd 
isotope records in early Cenozoic paleoceanography. Chem. Geol. 269, 329-338 
(2010). 

34. Chavagnac, V. et al. Towards the development of a fossil bone geochemical 
standard: an inter-laboratory study. Anal. Chim. Acta 599, 177-190 (2007). 

35. Ling, H. F. et al. Evolution of Nd and Pb isotopes in Central Pacific seawater from 
ferromanganese crusts. Earth Planet. Sci. Lett 146, 1-12 (1997). 

36. Ling, H.-F. etal. Differing controls over the Cenozoic Pb and Nd isotope evolution of 
deepwater in the central North Pacific Ocean. Earth Planet. Sci. Lett 232, 345-361 
(2005). 

37. Scher, H. Stacking PEAT; a neodymium isotope stack for the Paleogene equatorial 
Pacific. Rendiconti Soc. Geol. Ital. 31, 191-192 (2014). 

38. Scher, H. D. & Martin, E. E. Circulation in the Southern Ocean during the 
Paleogene inferred from neodymium isotopes. Earth Planet. Sci. Lett 228, 
391-405 (2004). 

39. Scher, H. D., Bohaty, S., Zachos, J. C. & Delaney, M. L. Two-stepping into the 
icehouse: East Antarctic weathering during progressive ice-sheet expansion at the 
Eocene-Oligocene Transition. Geology 39, 383-386 (2011). 


©2015 Macmillan Publishers Limited. All rights reserved 


40. 
41. 


42. 
43. 


44. 


45. 
46. 


47. 


Scher, H. D., Bohaty, S.M., Smith, B. & Munn, G. Isotopic interogation of a suspected 
late Eocene glaciation. Paleoceanography 29, 628-644 (2014). 

Martin, E. E. & Scher, H. A Nd isotopic study of southern sourced waters and 
Indonesian throughflow at intermediate depths in the Cenozoic Indian Ocean. 
Geochem. Geophys. Geosyst. 7, QO9NO2 (2006). 

Scher, H. D. & Martin, E. E. Timing and climatic consequences of the opening of 
Drake Passage. Science 312, 428-430 (2006). 

Goldstein, S. J. & Jacobsen, S. B. The Nd and Sr isotopic systematics of river-water 
dissolved material — implications for the sources of Sd and Sr in seawater. Chem. 
Geol. 66, 245-272 (1987). 

Goldstein, S. J. & Jacobsen, S. B. Nd and Sr isotopic systematics of river water 
suspended material; implications for crustal evolution. Earth Planet. Sci. Lett 87, 
249-265 (1988). 

Taylor, S. R. & McLennan, S. M. The Continental Crust: Its Composition and Evolution 
(Blackwell, 1985). 

Joseph, L. H., Rea, D. K.& van der Pluijm, B. A. Neogene history of the Deep Western 
Boundary Current at Rekohu sediment drift, Southwest Pacific (ODP Site 1124). 
Mar. Geol. 205, 185-206 (2004). 

Stickley, C. E. et a/. Late Cretaceous to Quaternary biomagnetostratigraphy of ODP 
Sites 1168, 1170, 1171, and 1172, Tasmanian Gateway. Proc. ODP Sci. Res. 
http://www-odp.tamu.edu/publications/189_SR/111/111.htm (2004). 


48. 


49. 


50. 


51. 


52. 


53. 


54. 


55. 


LETTER 


Fuller, M. & Touchard, Y. in The Cenozoic Southern Ocean. Tectonics, Sedimentation, 
and Climate Change between Australia and Antarctica (eds Exon, N., Kennett, J. P. & 
Malone, M.) 63-78 (American Geophysical Union, 2004). 

Lanyon, R., Varne, R. & Crawford, A. J. Tasmanian tertiary basalts, the Balleny 
plume, and opening of the Tasman sea (southwest Pacific-ocean). Geology 21, 
555-558 (1993). 

Quilty, P. G. Late Eocene foraminifers and palaeoenvironment, Cascade Seamount, 
southwest Pacific Ocean: implications for seamount subsidence and Australia- 
Antarctica Eocene correlation. Austral. J. Earth Sci. 48, 633-641 (2001). 

Quilty, P. G. Eocene and younger biostratigraphy and lithofacies of the Cascade 
Seamount, East Tasman Plateau, southwest Pacific Ocean. Austral. J. Earth Sci. 44, 
655-665 (1997). 

Bouma, A. H. Sedimentology of Some Flysch Deposits: A Graphic Approach to Facies 
Interpretation (Elsevier, 1962). 

Miller, R. D., Sdrolias, M., Gaina, C. & Roest, W. R. Age, spreading rates, and 
spreading asymmetry of the world’s ocean crust. Geochem. Geophys. Geosyst. 9, 
Q04006 (2008). 

Torsvik, T. H. et al. Phanerozoic polar wander, palaeogeography and dynamics. 
Earth Sci. Rev. 114, 325-368 (2012). 

Torsvik, T. H., Steinberger, B., Cocks, L.R. M. & Burke, K. Longitude: linking Earth’s 
ancient surface to its deep interior. Earth Planet. Sci. Lett 276, 273-282 (2008). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


-8000 -6000 4000 -2000 o 8000 -6000 ~4000 2000 0 


-8000 -6000 4000 2000 0) 8000 ~8000 ~4000 2000 0 


8000 6000 4000 2000 0 8000 8000 4000 2000 0 


8000 ‘6000 4000 -2000 0 8000 6000 4000 2000 o 
Extended Data Figure 1 | Tectonic reconstructions of the Tasmanian meridional section drawn through the narrowest portion of the gateway (red 
Gateway. Present-day coastlines (black), continental shelves (white), and arrow in each reconstruction). Palaeocurrent direction (indicated by black 


range of the innermost and outermost continent-ocean boundaries (COBs) for arrows) is based on the Nd isotope data from sites 1168 and 1172 (red circles), 
the conjugate margins of the Tasmanian Gateway (grey) are shown for time which indicate westward or eastward flow, depending on the eng values at the 


slices from the late Eocene (a, b) to the early Oligocene (c-h). The sites. The éyq values listed on the diagrams are the average values for the 
palaeobathymetry grid is based on ref. 53. Palaeolatitudes of the innermost and __ time slices shown. Numbers in parentheses show the standard error (see 
outermost COB locations in Fig. 2 are based on their position along a Source Data). 
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Extended Data Figure 2 | Comparison of absolute reference frames. The 
relative position of the northern conjugate margin of the Tasmanian Gateway 
relative to the Oligocene polar front was tested using three reference frames: 
a, ref. 54; b, ref. 55; and, ref. 18. Although the absolute palaeolatitudes of these 
features differ between these reference frames, the timing of the northern 
margin transit across the Oligocene polar front is not affected by the choice 
of reference frame because the reconstruction of the Oligocene polar front 

is reconstructed using the same rotation data as the margins. In each reference 
frame, the northern margin of the gateway migrates across the Oligocene polar 
front between 29 and 30 Ma. Light blue bands correspond to the maximum 


gateway width based on geophysically determined positions of the innermost 
COBs on the South Tasman Rise and Antarctica’, selected along the narrowest 
meridional transect of the gateway (Extended Data Fig. 1). Dark blue bands 
show the minimum gateway widths based on the outermost geophysically 
determined COBs. The Oligocene position of the polar front (yellow band) was 
derived from microfossil assemblage data in deep-sea sediment cores"’; these 
data were incorporated into the plate reconstruction used in this study. The 
grey band centred at 30 Ma highlights the age range of the South Tasman Rise 
crossing the Oligocene polar front. 
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Extended Data Figure 3 | Water-mass endmembers used for this study. Nd (see Extended Data Table 3 for reference list). b, Range of the endmember 


isotope records used to demarcate Pacific (red) and Indian—-Atlantic waters for the Pacific and Indian—Atlantic. Envelope around the Pacific 
(purple) waters over the study interval. a, Individual Nd isotope records Nd isotope stack is instrumental uncertainty (20). 
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Extended Data Figure 4 | Anomalous subsidence histories of Cascade Guyot —_ marine sediments are currently ~2,900 m below sea level at ODP site 1172, 
and East Tasman Plateau. Cascade Guyot subsided at least 1,000 m since the _ suggesting that the East Tasman Plateau subsided three times more than the 
late Eocene, judging by the sediments dredged high on the flank. Shallow Guyot over the same interval. 
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Extended Data Figure 5 | Bulk sediment geochemistry. a, Detrital Al/Ti 
ratios from ODP site 1172 (red triangles). b, Fossil fish tooth Nd isotope records 
from the Western Tasmanian Margin (WTM; site 1168; blue), the East 
Tasman Plateau (ETP; site 1172; grey), and the Hikurangi Plateau (HP; site 
1124; red). Error envelopes represent instrumental uncertainty (2c) based on 
replicate Nd isotope analyses of the JNdi-1 Nd isotope standard. Error bars 
(standard errors) are shown for samples where the standard error of the 
measurement exceeds 2c instrumental uncertainty. The Pacific and Indian- 
Atlantic xq envelopes are based on fossil fish tooth Nd isotope records from the 
equatorial Pacific, southern Kerguelen Plateau, Ninetyeast ridge, and Maud 
Rise (Extended Data Fig. 3). The range of xq values of weathering products in 
the Murray River is shown on the right axis. c, The palaeolatitudes of the 
conjugate margins of the Tasmanian Gateway outline the width and position of 


the gateway during its progressive opening. Light blue bands correspond to the 
maximum gateway width based on geophysically determined positions of the 
innermost COBs on the South Tasman Rise and Antarctica®, selected along 
the narrowest meridional transect of the gateway (Extended Data Fig. 1). The 
dark blue band shows the minimum gateway widths based on the outermost 
geophysically determined COBs. Error bars (standard errors) on the northern 
and southern boundaries of the gateway reflect uncertainty in the positions 
of the conjugate margins based on the 95% confidence intervals from the 
rotations’. The Oligocene position of the polar front (yellow band) was derived 
from microfossil assemblage data in deep-sea sediment cores’’, which were 
incorporated into the plate reconstruction used in this study. The grey band 
centred at 30 Ma highlights the age range of the South Tasman Rise crossing 
the Oligocene polar front. 
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Extended Data Table 1 | Lithological information from DSDP cores used to restore the Oligocene polar front, based on data in ref. 19 


DSDP Site 
267 
268 
269 
274 
278 
280 


281 


lithology (major, minor) 

carbonate 

terrigenous, minor opal 

terrigenous 

terrigenous and opal 

Opal, minor carbonate 

Opal, minor terrigenous and carbonate 


terrigenous/carbonate 


relative surface water 
temperature interpretation 


warm 
cold 
cool 
cold 
cold 
cold 


cool 
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Extended Data Table 2 | Ocean Drilling Program site information for this study 


Present 


Site name; province, Ocean Basin 


ODP Site 1124; Rehoku Drift, Pacific 


ODP Site 1168; west Tasman margin, Indian 


Latitude, Longitude 


39°29.897'S, 
176°31.893'W 


42°36.568'S, 
144°24.761'E 


43°57.571'S, 
149°55.706'E 


depth (m) 


3966.8 


2463.6 


2621.9 


Age model applied 


Joseph et al., 2004 


Stickley et al., 2004 


ODP Site 1172; east Tasman Plateau, Pacific 


References are Joseph et al.*®, Stickley et al.4” and Fuller and Touchard*®. 
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Extended Data Table 3 | Details of the ODP sites used as the Pacific and Indian—-Atlantic endmembers in Fig. 1 and Extended Data Table 2 


Site/River 


ODP Site 689 

ODP Site 738 

ODP Site 757 

ODP Site 1090 

IODP sites U1331, 
U1332, U1333, U1334, 
U1335 


Murray River 


*Palaeodepth at 34 Ma. 


Location Latitude 
Maud Rise, Atlantic 


sector Southern Ocean cle 


Southern Kerguelen 
Plateau, Indian sector 62.7°S 
Southern Ocean 


Ninetyeast Ridge, 


Indian Ocean iS 
Aghulas Ridge, south ° 
Atlantic Ocean oy 
Equatorial Pacific 5-12°N 


Murray Bridge, SA na 


Longitude 


5.1°W 


82.8°E 


88.2°E 


8.9°E 


126 - 142°W 


na 


Paleodepth (m)? 


1500 


1750 


1000-1500 


3400 


2500-4500 


na 
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Nd isotope data source 


Scher and Martin, 2004 


Scher et al., 2011 


Martin and Scher, 2006 


Scher and Martin, 2006; 2008 


Scher, 2014 


Goldstein and Jacobsen, 1987; 
Goldstein and Jacobsen, 1988 
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A Middle Triassic stem-turtle and the evolution of the 


turtle body plan 


Rainer R. Schoch! & Hans-Dieter Sues” 


The origin and early evolution of turtles have long been major 
contentious issues in vertebrate zoology’ "'. This is due to conflict- 
ing character evidence from molecules and morphology and a 
lack of transitional fossils from the critical time interval. The 
~220-million-year-old stem-turtle Odontochelys from China’ 
has a partly formed shell and many turtle-like features in its post- 
cranial skeleton. Unlike the 214-million-year-old Proganochelys 
from Germany and Thailand, it retains marginal teeth and lacks 
a carapace. Odontochelys is separated by a large temporal gap from 
the ~260-million-year-old Eunotosaurus from South Africa, which 
has been hypothesized as the earliest stem-turtle**. Here we report 
a new reptile, Pappochelys, that is structurally and chronologically 
intermediate between Eunotosaurus and Odontochelys and dates 
from the Middle Triassic period (~240 million years ago). The 
three taxa share anteroposteriorly broad trunk ribs that are 
T-shaped in cross-section and bear sculpturing, elongate dorsal 
vertebrae, and modified limb girdles. Pappochelys closely resem- 
bles Odontochelys in various features of the limb girdles. Unlike 
Odontochelys, it has a cuirass of robust paired gastralia in place of 
a plastron. Pappochelys provides new evidence that the plastron 
partly formed through serial fusion of gastralia*’’. Its skull has 
small upper and ventrally open lower temporal fenestrae, support- 
ing the hypothesis of diapsid affinities of turtles*”'®'*”. 

Turtles are readily diagnosed by the possession of a bony shell con- 
sisting of a dorsal carapace and a ventral plastron, which are linked by 
bony bridges on either side. In recent years embryological studies on 
extant turtles’’ and recognition of stem-turtles**’” have provided evid- 
ence about the stepwise acquisition of features of this unique body plan. 
The discovery of a new stem-turtle from the Middle Triassic (Ladinian) 
of Germany sheds new light on this evolutionary transition as well as on 
the long-contentious relationships of turtles to other amniotes. 


Reptilia Laurenti, 1768 
Pan-Testudines Joyce, Parham and Gauthier, 2004 (ref. 16) 
Pappochelys gen. nov. 


Etymology. Pappos (Greek): grandfather; chelys (Greek): turtle. Type 
species. Pappochelys rosinae. 


Pappochelys rosinae sp. nov. 


Etymology. In honour of I. Rosin, who prepared key specimens of the 
new taxon. 

Holotype. Staatliches Museum fiir Naturkunde Stuttgart, SMNS 91360, 
incomplete, partly articulated postcranial skeleton (Fig. 1a, b). 
Referred material. SMNS 90013, disarticulated skeleton with incom- 
plete skull (Fig. 1c, d), and 18 additional specimens. See Supplementary 
Information for details. 

Type locality. Schumann quarry, Eschenau, Vellberg municipality, 
Baden-Wiirttemberg, Germany. 

Type horizon. Top of Untere Graue Mergel, Lower Keuper (Erfurt 
Formation); late Middle Triassic (Ladinian: Longobardian). The fossils 
occur in a 5- to 15-cm-thick layer of dark grey lacustrine claystone, along 


with fishes, temnospondyl stem-amphibians, and mostly terrestrial 
diapsid reptiles'’. Despite extensive collecting in Lower Keuper strata 
for the past two centuries, not a single diagnostic bone of the new reptile 
was discovered until recently. Since 2006, the aforementioned fossils 
referable to this taxon have been recovered from the type locality. 

Diagnosis. Small (estimated adult length ~20 cm); skull proportio- 
nately small, with short, deep temporal region; parietal with distinct 
occipital flange; jaw bones bearing teeth; squamosal and parietal 
bounding much of upper temporal fenestra; lower temporal opening 


are : : — rib 


Figure 1 | P. rosinae. a, b, Articulated partial postcranial skeleton (SMNS 
91360, holotype); ¢, d, disarticulated skeleton with incomplete skull (SMNS 
90013). Photographs with explanatory outline drawings. Trunk ribs 
highlighted in black. Abbreviations: co, coracoid; d, dentary; dv, dorsal vertebra; 
f, frontal; fe, femur; il, ilium; j, jugal; p, parietal; pu, pubis; q, quadrate; sq, 
squamosal; sv, sacral vertebra; ti, tibia; tv, tail vertebra; question mark, 
unidentified cranial bone. 
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open ventrally; trunk ribs anteroposteriorly broad, T-shaped in cross- 
section and with dorsal sculpturing; gastralia paired, robust, with 
ridged external surface; whip-like tail comprising some 50% of total 
length; scapula with tall, straight dorsal process and small ‘acromial’ 
flange; coracoid plate-like; ilium with long postacetabular process; 
pubis with thyroid foramen and distinct lateral process. 

All specimens of the new taxon were mechanically prepared. 
Histological work on fossil bones employed standard techniques used 
for making petrographic thin-sections. 

The skull of Pappochelys (Fig. 2c) has a large orbit and short, pointed 
snout and is triangular in dorsal view. Its broad cheek region has a 
small rounded upper and a ventrally open lower temporal fenestra. 
The configuration of the bones in the temporal region is consistent 
with that in diapsid reptiles. 

The premaxilla, maxilla, and dentary bear teeth. The teeth are peg- 
like in the maxilla (Fig. 2a, b) and slightly inclined posteriorly in the 
anterior part of the dentary (Fig. 2i). In both upper and lower jaws, 
the more posterior teeth are smaller and somewhat more robust. The 
premaxilla bears four teeth, the maxilla up to 17, and the dentary at 
least 29. Tooth implantation appears to be subthecodont. The short 
maxilla has a large, posterodorsally directed facial process and a short 
anterior ramus. Its posterior process decreases in height posteriorly. 
The frontal (Extended Data Fig. 1g) is longer than the nasal and 
parietal. The parietal (Fig. 2d, Extended Data Fig. 1c, d) has a short 
posterolateral wing and distinctly offset occipital flange. There is no 
trace of a pineal foramen. The postorbital (Fig. 2e and Extended Data 
Fig. le) is short anteroposteriorly, with a rounded posterior and a long 
ventral process contacting the posterior margin of the dorsal process of 
the jugal. The postfrontal (Extended Data Fig. 1f) is triangular. The 
slender jugal (Fig. 2h and Extended Data Fig. 2) has an elongate, 
anterodorsally curving anterior process, a tapering dorsal process, 


Figure 2 | Skull elements of P. rosinae (digitally extracted from surrounding 
matrix). a, b, Left maxilla (SMNS 91431; a, labial view; b, lingual view of 
marked section); c, skull reconstruction in lateral view, with preserved elements 
indicated in grey; d, right parietal (SMNS 91356); e, right postorbital (SMNS 
91356); f, right squamosal (SMNS 90013); g, right quadrate (SMNS 90013); 
h, left jugal (SMNS 92066, broken into two segments and partly preserved as an 
impression); i, left dentary (SMNS 92066). 
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and a short posterior process that indicates a ventrally largely open 
lower temporal fenestra. The squamosal (Fig. 2f) has a long ventral 
process, tapering postorbital ramus, rounded posterior buttress, and 
median parietal process. The massive quadrate (Fig. 2g) has a slightly 
concave posterior margin. The palate is not clearly exposed in any 
available specimen. The dentary (Fig. 2i and Extended Data Fig. 2) is 
slender and curves anterodorsally. No other mandibular bones have 
been identified so far. 

Cervical vertebrae (Fig. 3g) are elongate and low, with extensive 
postzygapophyses and low neural spines. The number of presacral 
vertebrae and relative lengths of the neck and trunk remain unknown. 
Dorsal vertebrae (Fig. 3h, i) have long cylindrical centra with nearly 
vertical rib facets at about midlength. The neural arch is fused to the 
centrum in adults, and the neural spine is low. There is no trace of 
neurals. Although the trunk region is disarticulated in all available 
specimens, the maximum number of trunk vertebrae did not exceed 
nine. The anteroposteriorly broad trunk ribs (Fig. 3a-d) each bear 
slightly asymmetrical anterior and posterior flanges and are distinctly 
T-shaped in cross-section. The confluent rib heads have a figure-eight 
shape in end view. The distal ends of the ribs are tapered. Undistorted 


Figure 3 | Postcranial elements of P. rosinae (digitally extracted from 
surrounding matrix). a, Trunk rib in dorsal view (SMNS 92067); b, trunk rib 
in anterior view (SMNS 92068); c, d, trunk ribs in ventral view (c, SMNS 92063; 
d, SMNS 91360); e, gastralium (SMNS 91360); f, lateral ends of two fused 
gastralia (SMNS 91363); g, cervical vertebra (SMNS 91360); h, anterior dorsal 
vertebra (SMNS 91356); i, mid-dorsal vertebra (SMNS 91360); j, right scapula 
(SMNS 92044); k, incomplete interclavicle (SMNS 91895); 1, proximal portion 
of right scapula (SMNS 91895); m, right coracoid (SMNS 91360); n, left 
humerus (SMNS 91113); 0, right ilium (SMNS 91895); p, left pubis (SMNS 
91360); q, right femur (SMNS 91356); r, left tibia (SMNS 91360). 
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ribs curve laterally, with the posterior flange being more extensive than 
the anterior one. Large ribs bear pronounced sculpturing on the dorsal 
surface, comprising ridges that may bear tubercles in places (Fig. 3a). 
This sculpturing suggests an intradermal origin of these structures’. 
Thin-sections of trunk ribs (Extended Data Fig. 4) show that they are 
rather compact and closely resemble those of Eunotosaurus in their 
histological structure*. The whip-like tail of Pappochelys comprises 
over 23 vertebrae with low neural arches and long, cylindrical centra 
(Fig. la, b). 

The trunk has large paired gastralia (Fig. 3e and Extended Data Fig. 3). 
Individual elements are thick, with ridged surfaces and tapered ends. 
Successive gastralia occasionally fused to each other, as suggested by 
several particularly robust elements with forked distal ends (Fig. 3f). In 
ventral view, the anterior gastralia extend anterolaterally, whereas the 
reverse obtains on the posterior gastralia. None of the available fossils 
preserves undisturbed pairs of gastralia. 

The scapula (Fig. 3), 1) has a tall, slender dorsal process, which was 
probably aligned vertically, and a short ‘acromial’ process. Its glenoid 
facet faces posterolaterally. The plate-like coracoid (Fig. 3m) has an 
anterolateral glenoid facet. The interclavicle (Fig. 3k) has a rounded 
anterior process, posterolaterally extending lateral processes contact- 
ing the clavicles, and a long, tapering posterior process. The robust 
humerus (Fig. 3n) has expanded articular ends. Radius and ulna are 
slender, and the manus has slender digits with long, narrow unguals. 

The pelvis closely resembles those of Odontochelys’? and Progano- 
chelys’*. However, the ischium remains separate from the pubis. The 
ilium (Fig. 30) has a long postacetabular process with a strongly 
striated lateral surface and a straight dorsal margin. The pubis 
(Fig. 3p) has an oval thyroid foramen and a distinct lateral process. 
There is no trace of a hypoischium. 

The hindlimb is only slightly longer than the forelimb. The S-shaped 
femur (Fig. 3q) has a distinct internal trochanter and an offset head. 
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Figure 4 | Phylogenetic position of Pan-Testudines including Pappochelys 
among Amniota based on maximum parsimony analysis of the data matrix 
in ref. 5. Numbers at nodes indicate bootstrap percentages (only those >50) 
for each node. The tree has a length of 759 steps, a consistency index of 0.32, and 
a rescaled consistency index of 0.21. 
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The robust tibia (Fig. 3r) is much shorter than the femur. The pes is 
slightly larger than the manus. 

Morphologically Pappochelys represents an intermediate stage 
between Eunotosaurus**? and Odontochelys’*. It shares the T-shaped 
cross-sectional outline of the broad trunk ribs with both taxa. 
Saurosphargid reptiles also have broad trunk ribs**”’, but these differ 
from those of stem-turtles in being flatter (with the shaft not offset), 
straighter, single-headed, and uniform throughout the trunk region. By 
contrast, the thoracic ribs of Pappochelys and Odontochelys vary in 
shape with respect to their position, and some ribs have asymmetrical 
anterior and posterior flanges. Unlike Eunotosaurus and Pappochelys, 
Odontochelys has neurals and supports the hypothesis that the turtle 
carapace developed by outgrowth of intramembranous bone from the 
periosteum of the ribs and neural spines. Odontochelys also has a fully 
developed plastron whereas Eunotosaurus has paired gastralia*, as does 
Pappochelys. The gastralia, together with ventral elements of the shoul- 
der girdle, formed the plastron in turtles*"*. Although Pappochelys lacks 
a plastron, the typically thickened and sometimes two-ended gastralia 
indicate increased ossification in the ventral region and incipient fusion 
of successive elements in this taxon, supporting the hypothesis that the 
turtle plastron partly formed through their co-ossification*’’. The lateral 
ends of the plastral elements in Odontochelys form spine-like projec- 
tions’ that resemble the distal ends of the gastralia in Pappochelys both 
in their alignment and in their striated surface texture. 

The scapula and coracoid of Pappochelys closely resemble those of 
Odontochelys and the more derived Proganochelys. Compared with 
Proganochelys, the ‘acromial’ process of the scapula is short and forms 
a rounded anteromedial edge and the coracoid is not as expanded. 
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Figure 5 | Early evolution of the turtle body plan. a, Restoration of the 
skeleton of Pappochelys in lateral view (as yet unknown elements in white; 
preserved bones in grey; trunk ribs and gastralia highlighted in black); 

b, successive appearance of key features of the turtle body plan; ¢, plastron of 
Odontochelys and reconstructed ventral bones of the shoulder girdle and 
gastralia set in Pappochelys (elements of the shoulder girdle and their 
homologues are indicated in a darker shade of grey). 
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The pubis of Pappochelys has a distinct lateral process, as in 
Odontochelys and more derived turtles, where it contacts the plastron. 
The ilium closely resembles that of Proganochelys in outline. 

The presence of two temporal openings on either side of the cra- 
nium in Pappochelys supports the hypothesis of diapsid affinities for 
turtles*’*’°. The configuration of the squamosal, postorbital, and pari- 
etal closely resembles the condition in diapsid reptiles, whereas the 
small size of the upper temporal fenestra may suggest incipient reduc- 
tion of that opening. A phylogenetic position of turtles within Diapsida 
has been consistently recovered by phylogenetic analyses based on 
molecular data (although most of the latter specifically place turtles 
with or close to archosaurs)’*° but also by some morphologically 
based studies*’*"*. Our phylogenetic analysis (Fig. 4 and Extended 
Data Figs 5 and 6; for detailed data see Supplementary Information) 
recovered Pan-Testudines as the sister-taxon to Sauropterygia, a clade 
of marine saurian reptiles, and this grouping as the sister-group to 
Lepidosauriformes as previously suggested in refs 2 and 14. 
Traditionally, palaeontologists interpreted turtles as primarily lacking 
temporal openings and accordingly assigned them a basal position 
among reptiles®. Similarly, the skull of Eunotosaurus was long consid- 
ered ‘anapsid’ but, like Pappochelys, it has ventrally open lower tem- 
poral openings’. Furthermore, new research indicates that 
Eunotosaurus has upper temporal openings concealed by large supra- 
temporals”. 

Pappochelys is the most common reptile in the Vellberg lake deposit 
known so far, and is represented by various growth stages, which 
suggests that it either lived along the lakeshore or frequently entered 
the lake. Under a scenario that the turtle shell initially evolved in an 
aquatic setting’, the plastron may have first developed as protection 
and ‘bone ballast’ for controlling buoyancy®. The thick gastralia and 
ribs in Pappochelys are consistent with aquatic or semi-aquatic habits. 
Although the oldest fully shelled turtles were probably terrestrial**”, 
Odontochelys apparently lived in deltaic or lagoonal settings along a 
coastline*”. 

In summary, Pappochelys provides a new stage in the evolution of 
the turtle body plan (Fig. 5) and critical evidence for the diapsid rela- 
tionships of turtles. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extended Data Figure 1 | Cranial material of P. rosinae. a, b, Photograph dorsal view; e, left postorbital; f, left postfrontal; g, left frontal. Abbreviations: 
and explanatory outline drawing of partial skull and postcranial skeleton of dy, dorsal vertebra; f, frontal; ga, gastralium; j, jugal; mt, metatarsal; n, nasal; p, 
P. rosinae (SMNS 91356); ¢, left parietal in ventral view; d, right parietal in parietal; ph, phalanx; po, postorbital; pof, postfrontal; ti, tibia; tv, tail vertebra. 
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gas d 
j 
5mm d 
Extended Data Figure 2 | Skeletal remains of a very small individual of (SMNS 92066). Bones of the skull are shown in a darker shade of grey. 
P. rosinae. a, b, Photograph (a) and explanatory outline drawing (b) of Abbreviations: d, dentary; dv, dorsal vertebra; fe, femur; gas, gastralia; j, jugal; 
associated skeletal remains of a very small individual of P. rosinae prf, prefrontal; pt?, possible pterygoid; ti, tibia. 
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Extended Data Figure 3 | Gastralia of P. rosinae. a, b, Photograph (a) and explanatory outline (b) of a set of gastralia elements and fragments of two trunk ribs 
(black) that are part of the incomplete, partly articulated postcranial skeleton SMNS 91360. 
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Extended Data Figure 4 | Transverse section through the broadened shaft of a left trunk rib. 
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Extended Data Figure 5 | Tree illustrating hypothesis of turtle relationships based on the Tree Analysis using New Technology (TNT) program. Individual 
nodes are numbered. For additional information refer to ‘Phylogenetic analysis’ section in Supplementary Information. 
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Sparse whole-genome sequencing identifies two loci 
for major depressive disorder 


CONVERGE consortium* 


Major depressive disorder (MDD), one of the most frequently 
encountered forms of mental illness and a leading cause of disability 
worldwide’, poses a major challenge to genetic analysis. To date, 
no robustly replicated genetic loci have been identified’, despite 
analysis of more than 9,000 cases’. Here, using low-coverage 
whole-genome sequencing of 5,303 Chinese women with recurrent 
MDD selected to reduce phenotypic heterogeneity, and 5,337 con- 
trols screened to exclude MDD, we identified, and subsequently 
replicated in an independent sample, two loci contributing 
to risk of MDD on chromosome 10: one near the SIRT1 gene 
(P= 2.53 X 107°), the other in an intron of the LHPP gene 
(P = 6.45 x 10° '”). Analysis of 4,509 cases with a severe subtype 
of MDD, melancholia, yielded an increased genetic signal at the 
SIRT1 locus. We attribute our success to the recruitment of rela- 
tively homogeneous cases with severe illness. 

The existence and number of subtypes of depression have been 
debated over the past 100 years. The current consensus is that depres- 
sion may be a collection of partly distinct diseases, with overlapping 
causal pathways. This aetiologic heterogeneity might therefore sub- 
stantially reduce the power of genetic association studies, and hence 
explain the failure to find genetic risk loci’. For example, there may be 
cases of MDD of largely environmental origin whose presence reduces 
the power to detect genetic effects. Also, genetic risk factors for mild 
depressive syndromes may not be entirely the same as those for more 
severe cases’. 

For these reasons, we investigated the genetic basis of MDD in subjects 
for whom known sources of phenotypic and genetic heterogeneity were 
minimized and known risk factors documented. The CONVERGE 
(China, Oxford and Virginia Commonwealth University Experimental 
Research on Genetic Epidemiology) consortium recruited 11,670 Han 
Chinese women through a collaboration involving 58 hospitals in China. 
We studied only women because about 45% of the genetic liability to 
MDD is not shared between sexes*®. In an attempt to obtain severe 
cases of MDD, we recruited only recurrent cases (mean number of 
episodes was 5.6). 

We used low-coverage sequencing to genotype our sample’. Whole- 
genome sequences were acquired to a mean depth of 1.7 (95% con- 
fidence intervals (CIs) 0.7-4.3) per individual, from which 32,781,340 
SNP sites were identified. After applying stringent quality controls 
(Methods), we obtained 10,640 samples (5,303 cases of MDD, 5,337 
controls) and 6,242,619 SNPs for inclusion in genome-wide asso- 
ciation studies (GWAS). We compared genotypes from the low-cov- 
erage sequencing to genotypes called with 10X coverage sequence and 
to genotypes called from genotyping arrays and a mass spectrometer 
platform. The mean percentage concordance between genotypes from 
nine individuals with both low- and 10 coverage across all sites was 
98.1% (Supplementary Table 1). We compared imputed genotypes 
to those acquired for 72 individuals using an array and to 21 SNPs 
genotyped on all individuals with the MassARRAY system mass 
spectrometer (Supplementary Notes). Overall concordance was 
98.0% (Supplementary Tables 2 and 3). 


Genetic association analysis was carried out with a linear mixed model 
with a genetic relatedness matrix (GRM) as a random effect and prin- 
cipal components from eigen-decomposition of the GRM as fixed effect 
covariates (Methods, Supplementary Notes)*’. Fig. la and Extended 
Data Fig. 1 show the Manhattan and quantile-quantile plots, respect- 
ively, for this analysis. The genomic control inflation factor (A, the ratio 
of the observed median y* to that expected by chance) for association 
with MDD was 1.070 (for common SNPs, minor allele frequency (MAF) 
>2%, 2 = 1.074). The adjusted measure for sample size to that of 1,000 
cases and 1,000 controls (A;000) was 1.013. 

Two loci exceeded genome-wide significance in association with 
MDD: one 5’ to the sirtuinl (SIRT1) gene on chromosome 10 (SNP = 
1s12415800, chromosome 10:69624180, MAF = 45.2%, P = 1.92 x 10° 
Fig. 1b), and the other in an intron of the phospholysine phospho- 
histidine inorganic pyrophosphate phosphatase (LHPP) gene 
(SNP = rs35936514, chromosome 10:126244970, MAF = 26.0%, 
P=1.27X10 8, Fig. 1c). All SNPs with P values of association 
<10-° with MDD are listed in Supplementary Table 4. 

We checked the accuracy of the imputed genotypes at 12 SNPs with 
P<1X10 °, by re-genotyping the CONVERGE samples using a 
MassARRAY system mass spectrometer, thereby confirming their 
association with MDD. Extended Data Table 1 shows that the correla- 
tion between the two assays was high (mean r° = 0.984), and the odds 
ratios for the two genome-wide significant SNPs assessed by the two 
methods were almost identical, with highly overlapping confidence 
intervals (rs12415800 odds ratios: 1.167 versus 1.167; rs35936514 odds 
ratios: 0.845 versus 0.842). 

We replicated the associations by genotyping the same 12 SNPs ina 
separate Han Chinese cohort of 3,231 cases with recurrent MDD, and 
3,186 controls (both sexes). Two SNPs at the peaks of association for 
SIRT1 and LHPP loci (rs12415800 and rs35936514, respectively) for 
MDD in the CONVERGE samples were significantly associated with 
MDD (Table 1). Analysis of the combined samples gave P values for 
association with MDD at these two SNPs of 2.5310 '° and 
6.45 X 10° |”, respectively. Extended Data Table 2 shows the genotype 
distribution and P values for tests of violation of the Hardy-Weinberg 
equilibrium in both the CONVERGE samples and the replication 
cohort at both SNPs. 

Comparison with results from the Psychiatric Genomics Con- 
sortium (PGC) mega-analysis of European studies’ failed to provide 
robust replication for our top SNPs (Extended Data Fig. 2 and 
Extended Data Table 3). However, the proportion of associations in 
the same direction in the two studies exceeded expectations due to 
chance (P< 0.001), and polygenic risk scores from the PGC mega- 
analysis applied to the CONVERGE samples were of significant 
(P < 0.01) but limited predictive value, accounting for 0.1% of MDD 
risk in the CONVERGE cohort (Extended Data Table 4). It is unclear 
to what extent differences in sample ascertainment, ethnicity, or other 
factors contribute to the failure to replicate genetic effects in the PGC 
sample. Notably, variants at our most strongly associated loci are 
much rarer in European populations, where rs12415800 (SIRT1) 


*A full list of authors and affiliations appears at the end of the paper. 
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Figure 1 | Two loci associated with MDD in the 
CONVERGE sample. a, Manhattan plot of 
genome-wide association for MDD. b, Association 
at the SIRT1 region on chromosome 10 at 69.6 
megabases (Mb). c, Association at the LHPP gene 
on chromosome 10 at position 126.2 Mb. For 

b and c The —logio(P value) of imputed SNPs 
associated with MDD is shown on the left y axis. 
The recombination rates expressed in 
centimorgans (cM) per Mb (NCBI Build GRCh37; 
light blue lines), are shown on the right y axis. 
Position in Mb is on the x axis. Linkage 
disequilibrium of each SNP with the top SNP, 
displayed as a large purple diamond, is indicated by 
its colour. The plots were drawn using 
LocusZoom”. 
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and rs35936514 (LHPP) have frequencies of 3% and 8% respectively, 
compared to 45% and 26% in the CONVERGE cohort. 

We considered whether successful mapping of MDD in the 
CONVERGE samples was attributable to the recruitment of a severe, 
more genetically determined form of the disease. We tested that 
hypothesis by looking within the CONVERGE cohort at a particularly 
severe, and more heritable form of MDD: melancholia’. Prior 
research has suggested that MDD patients with melancholia have 
more impairing, recurrent episodes and that risk for MDD is higher 
in the co-twins of probands with the melancholic subtype’! than in 
those with non-melancholic MDD. This increase is greater in mono- 
zygotic than dizygotic twin pairs’, as would be expected if the subtype 
were associated with greater genetic risk. 

In the CONVERGE cohort, 85% of cases met the DSM-IV criteria 
for melancholia’*. We searched for a genetic association in 9,846 sam- 
ples (4,509 cases and 5,337 controls) and identified the same two loci 
that exceeded genome-wide significance on chromosome 10. The 


genomic control inflation factor 4 for melancholia was 1.069, and 
Aio00 Was 1.014. Even though the sample for melancholia was smaller 
than for MDD, at the SIRT1 locus the significance of association 
was two orders of magnitude greater than for MDD (top SNP = 
rs80309727, chromosome 10:69617347, MAF = 45.2%, P= 2.95 X 
10 1°), Extended Data Fig. 3 shows the Manhattan plot, quantile- 
quantile plot and detailed views of the SIRT1 locus associated with 
melancholia. All SNPs with P values of association <10-° with mel- 
ancholia are listed in Supplementary Table 5. To determine whether 
the increased association might have arisen by chance, we generated 
an empirical distribution of odds ratios by randomly selecting 4,509 
cases from the total set and re-analysing the association with each of 
the genome-wide significant variants. We found that the observed 
value lay on the 98.8th percentile at the SIRTI locus, but at the 
61.6" percentile at the LHPP locus (Extended Data Fig. 4). 

Our results indicate that, as others have suggested”’, obtaining low- 
sequence coverage of a large number of individuals can be an effective 
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Table 1 | Genetic association between MDD and 12 variants in the CONVERGE cohort and a replication sample 


SNP CONVERGE (n= 10,640) Replication (n = 6,417) Joint (n = 17,057) 

Chr. Pos. RSID Ref. Alt. Freq. Info. OR s.e. P OR s.e. P OR s.e. P 

1 11493832 = rs2922240 T C 0385 1.018 1.141 0.028 280x10-° 0.949 0.037 154x107! 070 0.022 246x103 

1 175151950 = rs3766688 T C 0.394 1.003 0.875 0.028 183x10° 0.991 0.037 815x107! 0.918 0.022 1.34x10~4 

1 228052027 rs57047840 A G 0284 0970 1.138 0.031 464x10° 1.001 0.041 990x107! 088 0.025 5.57x10% 

5 9161674 rs55713588 A G 0.096 0.893 .278 0.050 604x10’ 1.054 0.062 393x101! 042 0.035 208x101 

6 4386107 rs55800092  C T 0.151 1.001 0824 0.039 1.35x10° 0.962 0.052 449x107! 0876 0.031 1.82x10°° 

10 69624180  rs12415800 G A 0.452 0.992 1.164 0.028 1.92x10-® 1.130 0.036 7.71x107* 1.150 0.022 2.37x1071° 
10 126244970 rs35936514 C T 0.260 0.993 0.839 0.032 1.27x10-® 0.838 0.041 1.68x10-5 0.842 0.025 6.43x10~1? 
13. 107659212 +rs61967003 C T 0.017 0.999 645 0.109 670x10° 0.788 0150 1.11%x10+ 277 0.087 481 x1073 

14 66833851 rs17827252 C G 0463 1.011 0.887 0.028 144x10°° 0.962 0.041 341x101 0907 0.023 2.20x10° 

19 34493757 ~=—rs11880240 =6C G 0.068 1.019 291 0.055 802x10° 1.048 0.072 512x107 184 0.043 9.15x10°° 

Xx 24656658 ~—srs1921918 A G 0.721 0.995 0.883 0.031 3.22x10°° 0.994 0.047 9.01x107! 0.917 0.026 1.09x10-° 

Xx 25011374 ~=-rs11573525 C T 0.260 0.971 160 0.032 586x107 1.011 0.047 819x107! 100 0.027 218x10% 

The table reports results for 12 SNPs in the CONVERGE and replication samples. The first five columns give the chromosome (Chr.), genomic position (Pos.), SNP identifier (RSID), reference allele (Ref.) on Human 
Genome Reference GRCh37.p5 and alternative allele (Alt.) called in CONVERGE. The next five columns show the alternative allele frequency (Freq.) and results of association testing with MDD using imputed allele 
dosages in 10,640 CONVERGE samples (5,303 cases, 5,337 controls); information scores (Info.), odds ratio (OR) of association with MDD with respect to the alternative allele and standard error (s.e.) in the odds 


ratio were obtained from a logistic regression model; P values of association (P) were obtained from a linear-mixed model with a GRM containing all samples. The next three columns present the results of 
association with MDD in the replication cohort of 6,417 samples (3,231 cases, 3,186 controls) from a logistic regression model. The final three columns present the results of association with MDD in a joint analysis 
with both CONVERGE and replication cohorts from a logistic regression model. Bold type indicates the genome-wide significant markers. 


way to screen the genome for association signals. We were able to 
genotype more variants than on genotyping arrays and our set is larger 
than publicly available sources for imputation™*. Our imputation pipe- 
line employed standard tools, and it is likely that imputation accuracy 
could be improved with further algorithmic research. 

MDD is most probably highly polygenic’, and many additional loci 
remain to be discovered. We attribute the discovery and replication of 
two SNPs associated with MDD in the CONVERGE cohort to the 
recruitment of cases who were probably more homogeneous and more 
severely impaired than those collected in previous studies from 
Western cultures. In East Asia, reluctance to report MDD probably 
explains why hospital-ascertained cases are more severe, and why 
prevalence estimates for MDD are lower in China (3.6%'°) than in 
the US (16.2%)’”. Consistent with this interpretation, 85% of the cases 
of MDD in the CONVERGE cohort have melancholia, a severe sub- 
type of MDD; mapping melancholia led to a significant increase in 
the genetic signal at one locus. Finally, we note that one of the repli- 
cated risk loci is located close to a gene involved in mitochondrial 
biogenesis (SIRT1)'*, which, together with our finding that MDD is 
associated with increased amounts of mitochondrial DNA”, suggests 
an unexpected origin for at least some of the phenotypic manifesta- 
tions of MDD. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Sample collection. CONVERGE collected cases of recurrent major depression 
from 58 provincial mental health centres and psychiatric departments of general 
medical hospitals in 45 cities and 23 provinces of China. Controls were recruited 
from patients undergoing minor surgical procedures at general hospitals (37%) or 
from local community centres (63%). A sample size of 6,000 cases and 6,000 
controls was chosen on the basis of evidence available when the study was designed 
(in 2007) of the likely existence of genetic loci with odds ratio of 1.2 and above. All 
subjects were Han Chinese women with four Han Chinese grandparents. Cases 
were excluded if they had a pre-existing history of bipolar disorder, psychosis or 
mental retardation. Cases were aged between 30 and 60 and had two or more 
episodes of MDD meeting DSM-IV criteria’! with the first episode occurring 
between 14 and 50 years of age, and had not abused drugs or alcohol before their 
first depressive episode. All subjects were interviewed using a computerized assess- 
ment system. Interviewers were postgraduate medical students, junior psychia- 
trists or senior nurses, trained by the CONVERGE team for a minimum of 1 week. 
The diagnosis of MDD was established with the Composite International 
Diagnostic Interview (CIDI) (WHO lifetime version 2.1; Chinese version), which 
used DSM-IV criteria. The interview was originally translated into Mandarin by a 
team of psychiatrists at Shanghai Mental Health Centre, with the translation 
reviewed and modified by members of the CONVERGE team. 

The replication sample was obtained from five hospitals in the north of China. 
Patients were diagnosed as having MDD by at least two consultant psychiatrists by 
DSM-IV criteria. Samples were of both sexes, and all four grandparents were Han 
Chinese. Cases were aged between 30 and 60, and had two or more episodes of 
MDD meeting DSM-IV criteria. Exclusion criteria were pregnancy, severe medical 
conditions, abnormal laboratory baseline values, unstable psychiatric features (for 
example, suicidal), a history of alcoholism or drug abuse, epilepsy, brain trauma 
with loss of consciousness, neurological illness, or a concomitant axis I psychiatric 
disorder. Control subjects were recruited from local communities and provided 
information about medical and family histories. Exclusion criteria were a history 
of major psychiatric or neurological disorders, psychiatric treatment or drug 
abuse, or a family history of severe forms of psychiatric disorders. 

The study protocol was approved centrally by the Ethical Review Board of 
Oxford University (Oxford Tropical Research Ethics Committee) and the ethics 
committees of all participating hospitals in China. All interviewers were mental 
health professionals who are well able to judge decisional capacity. The study 
posed minimal risk (an interview and saliva sample). All participants provided 
their written informed consent. 

DNA sequencing. DNA was extracted from saliva samples using the Oragene 
protocol. A barcoded library was constructed for each sample. All saliva samples 
were randomized in allocation to sequencing batches, and experimenters perform- 
ing the sequencing procedure were blinded to sample allocation and outcome 
assessment. Sequencing reads obtained from Illumina Hiseq machines were 
aligned to Genome Reference Consortium Human Build 37 patch release 5 
(GRCh37.p5) with Stampy (v1.0.17)” using default parameters after filtering 
out reads containing adaptor sequences or consisting of more than 50% poor 
quality (base quality =5) bases. Samtools (v0.1.18)”’ was used to index the align- 
ments in BAM format”’, and Picardtools (v1.62) was used to mark PCR duplicates 
for downstream filtering. The Genome Analysis Toolkit’s (GATK, version 2.6)** 
BaseRecalibrator was then run on the BAM files to create base quality score 
recalibration tables, masking known SNPs and INDELs from dbSNP (version 
137, excluding all sites added after version 129). Base quality recalibration 
(BQSR) was then performed on the BAM files using GATKlite (v2.2.15)** while 
also removing read pairs that did not have the ‘properly aligned segment bit set by 
Stampy (1-5% of reads per sample). 

Variant calling. Variant discovery and genotyping at all polymorphic SNPs in the 
1000G Phasel East Asian (ASN) reference panel'* was performed simultaneously 
using post-BQSR sequencing reads from all samples using the GATK’s 
UnifiedGenotyper (version 2.7-2-g6bda569). We set the option ‘--genotype_ 
likelihood_model’ to ‘BOTH’, used default annotation outputs for variant 
calls, and set the ‘--dbSNP’ option in order to use dbSNP v137 rsids to fill in the 
variant ID column of the output variant call format (VCF) files. Variant quality 
score recalibration was then performed on these sites using the GATK’s 
VariantRecalibrator (version 2.7-2-g6bda569) and the biallelic SNPs from 
1000G Phasel ASN samples as a true positive set of variants. A sensitivity thresh- 
old of 90% to SNPs in the 1000G Phasel ASN panel was applied for SNP selection 
for imputation after optimizing for Transition to Transversion (TiTv) ratios in 
SNPs called. This gave a total of 21,356,798 (9,053,391 known in 1000 Genomes 
Phase 1 ASN Panel and 11,486,024 novel) biallelic SNPs identified from all chro- 
mosomes and unassembled contigs. We put forth 20,539,441 SNPs from the 


autosomes and chromosome X for imputation of genotype probabilities and 
downstream analyses. 

Genotype likelihood calculation and imputation. Genotype likelihoods (GLs) 
were calculated at all 20,539,441 SNPs using a sample-specific binomial mixture 
model implemented in SNPtools (version 1.0)’°, and imputation was performed 
without a reference panel using BEAGLE (version 3.3.2)”°. We used BEAGLE to 
perform imputation, using ten iterations on chunks of 3,000 SNPs with 600 SNPs of 
overlap. A second round of imputation was performed with BEAGLE on the same 
GLs, but only at biallelic SNPs polymorphic in the 1000G Phase 1 ASN panel using 
572 haplotypes from the 1000 Genomes Phase 1 ASN samples as a reference panel 
for six iterations on chunks containing roughly 3,000 SNPs with 600 SNPs of 
overlap. After both rounds of imputation we removed the outer 300 SNPs of every 
window and ligated imputation results of adjacent chunks. A final set of allele 
dosages and genotype probabilities was generated from these two sets of imputed 
results by replacing the results in the former with those in the latter at all sites 
imputed in the latter. We then applied a conservative set of inclusion thresholds for 
SNPs for GWAS: (a) P value for violation of the Hardy-Weinberg equilibrium 
>10 °; (b) information score >0.9; (c) MAF in CONVERGE >0.5%, to arrive at 
the final set of 6,242,619 SNPs for GWAS. 

Sample selection for GWAS. Using both processed sequencing data and imputed 
dosages at SNPs that passed quality control, we assessed the sequencing and 
imputation quality of all 11,670 samples whose genomic variants we imputed. 
We first looked into both the nuclear genome and mitochondrial genome for an 
excess of variants called, since this would indicate cross-sample contamination due 
to technical issues during sequencing. We quantified the number of singleton 
variants called in genic regions of the nuclear genome and found a mean of 
71.55 private variants per sample that were supported by more than 2 sequencing 
reads passing sequencing quality controls. We excluded 117 samples with a num- 
ber of singletons greater than the 99" percentile. Coverage of the mitochondrial 
genome was, on average, 102, allowing us to obtain high-quality sequences for 
this part of the genome. We found a mean of 15.70 heteroplasmic sites per sample, 
and 116 samples were found to have greater than the 99th percentile of the number 
of heteroplasmic sites. Of these 116 samples, 26 were already discarded for having 
excess nuclear genome singletons; and we excluded the remaining 90. 

We then checked imputation quality based on the certainty of genotypes 
imputed (maximum genotype probability >0.9). We identified 29 individuals 
who had fewer than 90% of their sites with maximum genotype probabilities 
>0.9. We excluded these samples from further analysis. 

Finally, we assessed the 11,434 remaining samples for genetic relatedness. 
Although being unrelated to other individuals recruited for the CONVERGE study 
was a clear criteria in our data collection process, there were instances when the 
same patient or a relative of the patient visited multiple hospitals and was thus 
recruited more than once. To exclude duplicates and first-degree relatives from 
our sample for GWAS, we estimated pairwise genome-wide identity by descent 
(IBD) using identity by state (IBS) information in hard-called genotypes 
from imputed genotype probabilities at 399,211 common tagging SNPs across 
all autosomes (MAF > 1%, linkage disequilibrium (LD) <0.5, all known in 
1000 Genomes Phase 1). We implemented this in PLINK (v1.07)” with the option 
‘--genome’. We excluded a total of 392 samples (duplicates and first-degree rela- 
tives) from our final set of samples for GWAS. We retained second-degree relatives 
and beyond, correcting for the relatedness between them using a linear mixed 
model. We also excluded 402 samples with incomplete phenotype information, 
giving a final set of 10,640 samples (5,303 cases of MDD, 5,337 controls) for the 
primary GWAS of MDD. 

GWAS using linear mixed model and liability score estimates. We implemen- 
ted MLMA using factored spectrally transformed linear mixed models (FastLMM, 
v2.06.20130802) ®?° and computed one GRM per chromosome using the mixed 
linear model with candidate marker excluded (MLMe) approach, removing the 
SNPs from the chromosome in question from a base set of 322,911 common 
tagging SNPs from all autosomes (MAF > 1%, LD <0.5, all known in 1000G 
Phasel ASN panel) to prevent loss of power through ‘double fitting’ of the can- 
didate SNP (and those in LD with it) in the GRM as a random effect, while testing 
each SNP as a fixed effect. Manhattan plots and quantile—quantile plots of the log, 
of P values of the GWAS were generated with custom code in R (ref. 28). Genomic 
control inflation factor A was calculated using custom code in R (ref. 28). 
Replication and joint analyses. We genotyped the replication sample on a 
MassARRAY system mass spectrometer. TYPER4.0 was used to assess the reli- 
ability of genotype calls generated by SpectroREAD from the mass spectra. Default 
genotype call inclusion criteria were used. To perform the association analysis with 
MDD case-control status at these 12 sites in the replication sample, we obtained 
effect sizes for discovery from logistic regression with principal component (PC) 
correction, and then for replication from logistic regression, and then performed 
fixed-effects meta-analysis. 
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Polygenic risk profiling and binomial sign-test. Single SNP association results 
were obtained from the PGC study of MDD’. Prior to analysis, SNPs were lifted 
over to GRCh37/hg19 coordinates and excluded if: (a) monomorphic in either 
European (n = 379) or East Asian (n = 286) populations from the 1000 
Genomes Project Phase 1 reference data'*; or (b) absent from the filtered 
CONVERGE data set. To construct the PGC-trained polygenic score, we ini- 
tially selected autosomal SNPs with statistical imputation information 
(information score) greater than 0.9 and MAF greater than 1% in both studies, 
and performed subsequent LD-based ‘clumping’ to remove markers from 
highly correlated SNP pairs (pairwise r? > 0.2 in East Asians, 500 kb window) 
while preferentially retaining SNPs with smaller PGC P values. Using the res- 
ultant SNP set, we constructed polygene scores based on varying P value thresh- 
olds (1 X 107°, 1 X 107°, 1 X 10-4, 1 X 107°, 0.01, 0.1, 0.2, 0.3, 0.4, 0.5, and 1) 
as previously described”. We assessed the predictive value of polygenic scores in 
a genetically unrelated subset of the CONVERGE sample (with pairwise relat- 
edness less than 0.1) by logistic regression, with adjustment for ancestry prin- 
cipal components, demonstrating significant association with MDD status. The 
estimated variance in MDD risk accounted for by the polygenic score is given by 
Nagelkerke’s R’. Using the same P value thresholds, we tabulated the number of 
independent SNPs with the same direction of allelic effect in the PGC results as 
observed in CONVERGE. The filtering criteria for SNPs was an information 
score greater than 0.9 in CONVERGE and MAF greater than 1% in both studies; 
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and an analogous LD-clumping procedure was performed (pairwise r° > 0.2 in 
Europeans, 500 kb window). A one-sided binomial sign test was used to assess 
whether this observed fraction was significantly greater than that expected by 
chance. Results are given in Extended Data Table 4. 
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Extended Data Figure 1 | Quantile-quantile plots for major depressive method implemented in FastLMM on 10,640 samples (5,303 cases, 5,337 
disorder. Quantile-quantile plot of GWAS for MDD using the mixed linear _ controls). Genomic inflation factor 2 = 1.070, rescaled for an equivalent study 
model with exclusion of the chromosome that the marker is on (MLMe) of 1,000 cases and 1,000 controls (A;999) = 1.013. 
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rs2922240 chr1:11493832:T/C 
study freq _ info or se pval 
pri 0.385 1.018 1.141 0.028 2.80E-06 
sqnm 1.021 1.141 0.029 5.82E-06 
repli 0.386 0.949 0.037 0.15 
joint 0.386 1.070 0.022 2.47E-03 
pgc 0.495 0.999 0.947 0.021 0.011 
rs57047840 chr1:228052027:A/G 
*rs10916214(R2:0.479) 

study freq _ info or se pval 
pri 0.284 0.970 1.138 0.031 4.64E-05 
sqnm 0.970 1.141 0.032 4.13E-05 
repli 0.302 1.001 0.041 0.99 
joint 0.293 1.088 0.025 7.02E-04 


pgc* 0.133 0.992 0.969 0.030 0.300 


rs55800092 chr6:4386107:C/T 
*rs17138114(R2:0.891) 

study freq info or se pval 
pri 0.151 1.001 0.824 0.039 1.35E-06 
sqnm 1.001 0.819 0.040 6.35E-07 
repli 0.962 0.052 0.45 


joint 0.876 0.031 1.77E-05 


0.138 
0.144 
0.073 0.975 0.972 


0.037 0.440 


pgc* 


rs35936514 chr10:126244970:C/T 
*rs35841851(R2:0.981) 

study freq _ info or se 
pri 0.260 0.993 0.839 0.032 
sqnm 0.992 0.845 0.033 
repli 0.838 0.041 
joint 0.842 0.025 


pval 
1.27E-08 
2.91E-07 
1.68E-05 
6.45E-12 


0.261 
0.260 


0.023 0.920 0.970 0.051 0.553 


pgc* 


1817827252 chr14:66833851:C/G 
*rs2319184(R2:0.965) 

study freq info or se pval 
pri 0.463 1.011 0.887 0.028 1.44E-05 
sqnm 1.008 0.896 0.029 1.12E-04 
repli 0.458 0.962 0.041 0.34 
joint 0.460 0.907 0.023 1.93E-05 
pgce* 0.028 0.941 1.114 0.068 0.115 
rs1921918 chrX:24656658:G/A 
study freq info or se pval 
pri 0.721 0.995 0.883 0.031 3.22E-05 
sqnm 0.998 0.877 0.032 3.59E-05 
repli 0.733 0.994 0.047 0.9 
joint 0.726 0.917 0.026 6.89E-04 
pgc 0.581 1.238 0.960 0.019 0.035 
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rs3766688 chr1:175151950:T/C 


study freq info or se 
pri 0.394 1.003 0.875 


pval 


sqnm 1.000 0.870 

repli 0.388 0.991 0.037 0.81 
joint 0.392 0.918 0.022 1.35E-04 
pgc 0.546 0.926 0.970 0.022 0.163 


rs55713588 chr5:9161674:A/G 
*rs13360003(R2:0.177) 

study freq info or se pval 
pri 0.096 0.893 1.278 0.050 6.04E-07 
sqnm 0.893 1.302 0.052 3.34E-07 
repli 1.054 0.062 0.39 
joint 1.042 0.035 2.38E-01 


0.100 
0.095 
0.109 


0.018 0.774 0.920 


pgc* 


rs12415800 chr10:69624180:G/A 
*rs16924945(R2:0.323) 

study freq _ info or se 
pri 0.452 0.992 1.164 0.028 
sqnm 0.990 1.167 0.029 
repli 1.130 0.036 


joint 1.150 0.022 


pval 
1.92E-08 
8.44E-08 
7.71E-04 
2.53E-10 


0.443 
0.446 


0.005 0.355 1.629 0.716 0.496 


pge* 


rs61967003 chr13:107659212:C/T 
*rs16969540(R2:0.980) 

study freq _ info or se pval 
pri 0.017 0.999 1.645 0.109 6.70E-06 
sqnm 0.995 1.765 0.116 9.64E-07 
repli 0.015 0.788 0.150 0.11 
joint 0.016 1.277 0.087 4.87E-03 
pge* 0.055 0.982 0.941 0.049 0.211 
rs11880240 chr19:34493757:C/G 
*rs7254953(R2:0.645) 

study freq _ info or se pval 
pri 0.068 1.019 1.291 0.055 8.02E-06 
sqnm 1.017 1.281 0.056 1.08E-05 
repli 0.070 1.048 0.072 0.51 
joint 0.069 1.184 0.043 9.52E-05 
pgc* 0.092 0.785 1.054 0.046 0.253 
rs11573525 chrX:25011374:C/T 
study freq info or se pval 
pri 0.260 0.971 1.160 0.032 5.86E-06 
sqnm 0.975 1.158 0.033 9.60E-06 
repli 0.262 1.011 0.047 0.82 
joint 0.264 1.100 0.027 3.43E-04 
pgc 0.158 1.081 1.031 0.028 0.275 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


0.028 1.83E-06 ———#—— 
0.029 1.93E-06 ———#—— 


——— 
— 


—— 


0.85 0.9 0.95 | 1.05 


OR (95% Cl) 


+ 


0.442 ———_s—_—_ 


rt. a. va 
08 09 1 11°12 413 1.4 


OR (95% Cl) 


06 08 1 


OR (95% Cl) 


12°14 16 18 2 


06 08 1 12 14 16 18 


OR (95% Cl) 


a a er | 
1 6 Bs] 1.2 1.3 


OR (95% Cl) 


0.95 1 105 11 116 12 


OR (95% Cl) 


LETTER 


Extended Data Figure 2 | Forest plots of estimated SNP effects in 
CONVERGE and PGC studies. This figure presents the association odds 
ratios (OR) at 12 SNPs in CONVERGE and the best available proxy SNPs in 
PGC-MDD (pairwise r > 0.6, 500 kb window; the proxy SNP is marked by an 
asterisk). We present the alternative allele frequency (freq), odds ratio (or) with 
respect to the alternative allele, standard error of odds ratio (se) and P values of 
association (pval) for the following analyses (study): primary association 
analysis with a linear-mixed model using imputed allele dosages in 10,640 


samples in CONVERGE (pri); validation analysis with logistic regression 
model with principal components (PCs) as covariates using genotypes from 
Sequenom on 9,921 samples in CONVERGE (sqnm); association with MDD 
with a logistic regression model in a replication cohort of 6,417 samples using 
genotypes from Sequenom (repli); joint association analysis with MDD with a 
logistic regression model using imputed allele dosages in CONVERGE and 
genotypes from Sequenom in a replication cohort (17,057 samples in total; 
joint). 
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Extended Data Figure 3 | Manhattan and quantile quantile plots for Aioo0 = 1.014. ¢, Regional association plot of GWAS hits on chromosome 10, 
melancholia. a, Manhattan plot of GWAS for melancholia using the MLMe _ focusing on top SNP rs80309727 at 5’ of SIRT1 gene, generated with 
method implemented in FastLMM on 9,846 samples (4,509 cases, 5,337 LocusZoom. 


controls). b, Quantile-quantile plot of GWAS for melancholia; 2 = 1.069, 
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Extended Data Figure 4 | Empirical estimation of the odds ratio increases samples, equal to the number of cases of MDD not diagnosed as being 
due to the removal of cases not falling under the diagnostic class of melancholic. The horizontal axis is the odds ratio for each analysis, and the 


melancholia from an association analysis with major depression. The figures _ vertical axis the frequency of occurrence of the odds ratio in 10,000 analyses. 
show the empirical distributions of the odds ratios for association with each of | The vertical red line is the observed odds ratio after removing cases of MDD not 
two SNPs (rs79804696, rs35936514), after removing a random set of 796 diagnosed as melancholic. 
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Extended Data Table 1 | Comparison between association results using imputed dosages and directly genotyped markers 


CHR POS 

1 11493832 
1 175151950 
1 228052027 
5 9161674 

6 4386107 


10 69624180 
10 126244970 
13 107659212 
14 66833851 
19 34493757 
24656658 
x 25011374 


SNP 
RSID 

1S2922240 
rs3766688 
rs57047840 
rs55713588 
rs55800092 
rs12415800 
rs35936514 
rs61967003 
1817827252 
rs11880240 
rs1921918 


rs11573525 


REF 


Cc 
Cc 
G 
G 
T 
A 
T 
- 
G 
G 
A 
. 


F 


o of a O08 aA Oo Oo 8 fe + 4 A 


imputed Dosages (N=9,921) 
OR SE P 
1.141 0.029 5.82E-06 
0.870 0.029 1.93E-06 
1.141 0.032 4.13E-05 
1.302 0.052 3,34E-07 
0.819 0.040 6.35E-07 
1.167 0.029 8.44E-08 
0.845 0.033 2.91E-07 
1.765 0.116 9.64E-07 
0.896 0.029 1.12E-04 
1.281 0.056 1.08E-05 
0.877 0.032 3.59E-05 
1.158 0.033 9.60E-06 


N 
9,864 
9,901 
9,724 
9,636 
9,881 
9,689 
9,915 
9,914 
8,562 
9,912 
9,899 
9,912 


Sequenom genotypes 

r OR SE P 
0.991 1.141 0.029 5,.72E-06 
0.995 0.871 0.029 2.32E-06 
0.974 1.141 0.032 3.91E-05 
0.925 1.263 0.050 2.87E-06 
0.992 0.817 0.040 5.52E-07 
0.993 1.167 0.029 9.30E-08 
0.993 0.842 0.033 1.40E-07 
0.997 1.748 0.116 1.53E-06 
0.999 0.897 0.031  3.94E-04 
0.996 1.281 0.056 1.14E-05 
0.994 0.880 0.032 6.39E-05 
0.969 1.144 0.032 3.21E-05 


The table reports results for association between MDD and 12 SNPs. The first five columns give the chromosome (CHR), genomic position (POS), SNP identifier (RSID), reference allele (REF) on Human Genome 
Reference GRCh37.p5, and alternative allele (ALT) called in CONVERGE. The next three columns show results for imputed allele dosages at 12 SNPs (odds ratio (OR) of association with MDD with respect to the 
alternative allele and standard error (SE); P values of association (P)). The next two columns present the number of samples (N) successfully genotyped using the Sequenom platform (a high-sensitivity and 
-specificity assay), and the Pearson correlation (7) between the imputed allele dosages and the genotypes from Sequenom. The final three columns present results from analyses of association with MDD using 
genotypes from the Sequenom genotyping platform. Bold type indicates the genome-wide significant markers; Extended Data Table 2 gives further information on the results for these markers. 
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Extended Data Table 2 | Genotype distribution and P values for violation of the Hardy-Weinberg equilibrium in CONVERGE and replication 


cohorts 
Mpp te CONVERGE Replication Cohort 
Disease State HomRef/Het/HomAIt HWE P-value HomRef/Het/HomAIt HWE P-value 

rs12415800 2151/5301/3169 0.445 1212/3037/1920 0.857 
All 

rs35936514 705/4054/5794 0.919 422/2400/3398 0.974 

rs12415800 1178/2626/1490 0.744 654/1538/918 0.829 
Cases 

rs35936514 318/191 9/3027 0.549 190/1136/1783 0.627 

rs12415800 973/2675/1679 0.106 558/1499/1002 0.971 
Controls 

1835936514 387/2135/2767 0.389 232/1264/1615 0.503 


This table shows the number of samples with the homozygous reference genotype (HomRef), heterozygous genotypes (Het), and homozygous alternative genotype (HomAIt), as well as P values for violation of the 
Hardy-Weinberg equilibrium (HWE) for both CONVERGE study samples and the replication cohort from northern China at the top SNPs rs12415800 in the SIRT1 locus and rs35936514 in the LHPP locus from the 
GWAS on MDD. The top two rows show these measures for all samples in both the CONVERGE and replication study, the next two rows show these measures for just cases in CONVERGE and the replication cohort, 
and the last two rows show these measures for just the controls. The genotype distributions for CONVERGE are obtained from hard-called genotypes from maximum imputed genotype probabilities for each 
sample at each of the two sites. As a genotype will not be called if the maximum genotype probability at a site is lower than 0.9 for any single sample, the total number of CONVERGE samples showing called 
HomRef/Het/HomAlt genotypes does not equal 10,640 for either SNP. For rs12415800, 19 samples (9 cases, 10 controls) have no genotype calls owing toa maximum genotype probability smaller than 0.9, giving 
a total of 10,621 CONVERGE (5,294 cases, 5,327 controls) samples with genotype calls. For rs35936514, 87 (39 cases, 48 controls) samples have no genotype calls owing to a maximum genotype probability 
smaller than 0.9, giving a total of 10,553 (5,264 cases, 5,289 controls) CONVERGE samples with genotype calls. 
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Extended Data Table 3 | Single-marker association results of top CONVERGE hits in the PGC study of MDD 
CONVERGE (10,640 samples) PGC MDD (18,759 samples) 

“CHR POS RSID REF ALT FREQ OR SE  P RSID. LDr2 FREQ INFO OR P 
1 11493832 1s2922240 0.3846 1.141 0.028 2.80E-06 1s2922240 1.00 0.495 0.999 0.947 0.011 
1 175151950 rs3766688 0.394 0.875 0.028 1.83E-06 rs3766688 1.00 0.546 0.926 0.970 0.163 
1 228052027 ~=—rs57047840 0.2843 1.138 0.031 4.64E-05 1s10916214 0.48 0.133 0.992 0.969 0.300 
5 9161674 1s55713588 0.0956 1.278 0.050 6.04E-07 1rs13360003 0.18 0.018 0.774 0920 0.442 
6 4386107 rs55800092 0.1512 0.824 0.039 1.35E-06 1s17138114 0.89 0.073 0.975 0.972 0.440 
0.4519 1.164 0.028 1.92E-08 rs16924945 0.32 0.005 0.355 1.629 0.496 
10 126244970 rs35936514 0.2609 0.839 0.032 1.27E-08 rs35841851 0.98 0.023 0.92 0.970 0.553 
13 107659212 rs61967003 0.0172 1.645 0.109 6.70E-06 +s16969540 0.98 0.055 0.982 0.941 0.211 
14 66833851 1817827252 0.4624 0.887 0.028 1.44E-05 1s2319184 0.96 0.028 0.941 1.114 0.115 
19 34493757 rs11880240 0.0679 1.291 0.055 8.02E-06  rs7254953 0.65 0.092 0.785 1.054 0.253 


Xx 24656658 rs1921918 


Cc 
Cc 
G 
G 
T 

10 69624180 rs12415800 A 
si 
T 
G 
G 
A 0.7206 0.883 0.031 3.22E-05 1s1921918 1.00 0.581 1.238 0.960 0.035 
a 


oOo 2©2 0. oO Oo OO °o. QO > =. I 


Xx 25011374 rs11573525 0.2602 1.160 0.032 5.86E-06 1s11573525 1.00 0.158 1.081 1.031 0.275 


The table compares results from 12 SNPs genotyped in the CONVERGE cohort with either the same SNPs, or best available proxies within a 500 kb window, as reported by the Major Depressive Disorder Working 
Group of the PGC. The first five columns give the SNP identifier (RSID), chromosome (CHR), genomic position (POS), reference allele (REF) on Human Genome Reference GRCh37.p5, and alternative allele (ALT) 
called in CONVERGE. The next four columns show the alternative allele frequency (FREQ) and results of association testing with MDD at the 12 SNPs in CONVERGE: odds ratio (OR) of association with MDD with 
respect to the alternative allele and standard error (SE) in the odds ratio were obtained from a logistic regression model with PCs as covariates; P values of association (P) were obtained from a linear mixed model 
with a genetic relatedness matrix containing all samples. The next three columns show the SNP identifier (RSID) of best available proxy of each SNP reported in PGC-MDD, the linkage disequilibrium correlation 
(LD a ) expressed as the value between the SNP in PGC-MDD and SNP in CONVERGE, and the alternative allele frequency (FREQ) at the SNP in PGC-MDD. The last three columns show the information scores 
(INFO), odds ratios (OR) and P values of association with MDD in PGC-MDD from a logistic regression model. Bold type indicates the genome-wide significant markers. 
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Extended Data Table 4 | Polygenic risk profiling and binomial sign tests 


Polygenic risk profiling Binomial sign test 

Be Pr P No. SNPs (%) P 
0.000001 0.000715 0.0174 3 (100) 0.125 
0.00001 8.40E-05 0.415 12 (66.7) 0.194 
0.0001 2.57E-05 0.652 62 (58.1) 0.126 
0.001 5.87E-06 0.829 481 (53.6) 0.0605 
0.01 8.67E-05 0.407 3632 (51.1) 0.101 
0.1 0.00142 0.000797 26106 (50.4) 0.126 
0.2 0.00126 0.00156 45166 (50.6) 0.00331 
0.3 0.00116 0.00246 61074 (50.5) 0.00627 
0.4 0.00125 0.00168 74676 (50.5) 0.00335 
0.5 0.0011 0.00317 86429 (50.4) 0.00758 
1 0.000924 0.00684 124361 (50.3) 0.0116 


The table shows the predictive value of a PGC-trained polygenic risk score on the CONVERGE results. Predictive values are shown at varying P value thresholds (pT) from P< 1 x 10° ° to 1 (thatis, all results). Pis the 
P value of the prediction and r? is the amount of variance explained (thus the table shows that including all independent SNPs from the PGC study of MDD, irrespective of individual P value, explained 0.09% of 
MDD risk in CONVERGE.). The number of independent SNPs at each threshold is presented (No. SNPs); the significance of the observed fraction (%) demonstrating a consistent direction of effect was assessed bya 
one-sided binomial sign test. 
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Impermanence of dendritic spines in live adult CA1 


hippocampus 


Alessio Attardo’?+*, James E. Fitzgerald'}* & Mark J. Schnitzer? 


The mammalian hippocampus is crucial for episodic memory 
formation’ and transiently retains information for about 3-4 weeks 
in adult mice and longer in humans’. Although neuroscientists 
widely believe that neural synapses are elemental sites of informa- 
tion storage’, there has been no direct evidence that hippocampal 
synapses persist for time intervals commensurate with the duration 
of hippocampal-dependent memory. Here we tested the prediction 
that the lifetimes of hippocampal synapses match the longevity of 
hippocampal memory. By using time-lapse two-photon microen- 
doscopy* in the CA1 hippocampal area of live mice, we monitored 
the turnover dynamics of the pyramidal neurons’ basal dendritic 
spines, postsynaptic structures whose turnover dynamics are 
thought to reflect those of excitatory synaptic connections”®. 
Strikingly, CA1 spine turnover dynamics differed sharply from 
those seen previously in the neocortex’ °. Mathematical modelling 
revealed that the data best matched kinetic models with a single 
population of spines with a mean lifetime of approximately 1-2 
weeks. This implies ~100% turnover in ~2-3 times this interval, a 
near full erasure of the synaptic connectivity pattern. Although 
N-methyl-D-aspartate (NMDA) receptor blockade stabilizes spines 
in the neocortex’®”, in CA1 it transiently increased the rate of spine 
loss and thus lowered spine density. These results reveal that adult 
neocortical and hippocampal pyramidal neurons have divergent 
patterns of spine regulation and quantitatively support the idea 
that the transience of hippocampal-dependent memory directly 
reflects the turnover dynamics of hippocampal synapses. 

The hypothesis that synaptic connectivity patterns encode informa- 
tion has profoundly shaped research on long-term memory. In the 
hippocampus, synapses in basal CA1 mainly receive inputs from hip- 
pocampal area CA3, and the CA3 — CA] projection has been widely 
studied regarding its plasticity and key role in memory. As in the 
neocortex, dendritic spines in the hippocampus are good proxies for 
excitatory synapses’*, motivating time-lapse imaging of spines as a 
means of monitoring synaptic turnover”"®. 

Previous work has illustrated in vivo imaging of CA1 spines in acute 
and recently also in chronic preparations’ *. We tracked spines for up 
to ~14 weeks by combining microendoscopes of diffraction-limited 
resolution’ (0.85 NA), a chronic mouse preparation for time-lapse 
imaging in deep brain areas*, and Thy1-GFP mice that express green 
fluorescent protein (GFP) in a sparse subset of CA1 pyramidal neurons 
(Fig. 1 and Extended Data Fig. 1). Histological analyses confirmed that 
this approach induced minimal activation of glia (Extended Data Fig. 2), 
as shown previously*"®. 

A major concern was that it is not possible to distinguish two or 
more spines spaced within the resolution limit of two-photon micro- 
scopy. This issue is critical for studies of hippocampal spines, which are 
more densely packed than neocortical spines'’. To gauge how com- 
monly the appearances of adjacent spines merged together in optical 
images, we examined tissue slices from Thy1-GFP mice, using both 


Pie 


two-photon microendoscopy and stimulated-emission depletion 
(STED) microscopy. STED microscopy offered super-resolution 
(~70 nm full width at half maximum (FWHM) lateral resolution), 
an optical resolution nearly nine times finer than that of two-photon 
microendoscopy’* (~610 nm), permitting tests comparing pairs of 
images of the same CA1 dendrites (Fig. 2a and Extended Data Fig. 3). 

As expected, we saw nearby spines in STED images that appeared 
merged in the two-photon images (Fig. 2a). Of 151 spines that 
appeared unitary in the two-photon images, 23 + 3.6% (standard error 
of the mean (s.e.m.)) were actually two spines and 6.0 + 1.6% were 
actually three spines (n = 12 dendrites) (Fig. 2b). Distances between 
merged spines in the two-photon images (0.51 + 0.14 um; mean + 
standard deviation (s.d.)) were below the (0.61 sm) resolution limit! 
(Fig. 2c). Clearly, merging can induce illusory spine stability, since two 
or more real spines must vanish for a merged spine to disappear. 

To treat merging effects quantitatively, we developed a mathemat- 
ical framework that permits systematic examination of turnover 
dynamics across different kinetic models and investigation of how 
the merging of spines on imaging alters the manifestations of these 
dynamics in two-photon imaging data (Supplementary Information 
and Extended Data Figs 4-7). We used computer simulations to study 
how the density and apparent kinetics of merged spines vary with 
geometric variations of individual spines, spine density, resolution 
and spine kinetics (Supplementary Information and Extended Data 
Fig. 8). We also checked experimentally whether fluctuations in spine 
angle and length, and the radius of the dendrite near the spine, might 
impact measures of spine turnover (Extended Data Figs 6 and 9). By 
simulating time-lapse image series, we scored and analysed synthetic 
data across a broad range of optical conditions, spine densities, geo- 
metries and turnover kinetics (Fig. 2d and Extended Data Figs 4 and 5). 

The simulations and mathematical modelling confirmed that naive 
analyses of two-photon data are inappropriate at the spine densities in 
CAI, owing to merging and the resulting illusion of increased stability 
(Supplementary Information and Extended Data Figs 4 and 7). For 
stability analyses, we followed previous studies’ in our use of the 
survival fraction curve, S(t), the fraction of spines appearing in the 
initial image acquired at t = 0 that also appeared in the image acquired 
at time t. The shape and asymptotic value of S(f) provide powerful 
constraints on kinetic models of turnover and the fraction of spines 
that are permanent (that is, a rate constant of zero for spine loss)’ 
(Supplementary Information). Strikingly, visual scoring of simulated 
images (Fig. 2d and Extended Data Fig. 5) yielded underestimates of 
spine density (Fig. 2e) and patent overestimates of the lifetimes of 
spines (Fig. 2f). But crucially, our treatment accurately predicted the 
relationship between the actual density and the visually determined 
underestimate (Fig. 2e), and properly explained the apparent turnover 
dynamics, S(t), in terms of the actual kinetics (Fig. 2f and Extended 
Data Figs 5c, 7). Overall, the simulations showed that face-value inter- 
pretations of two-photon images from CAI are untrustworthy, but 
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Figure 1 | Dendritic spines are dynamic in CA1 hippocampus of the 

adult mouse. a, A sealed, glass guide tube implanted dorsal to CA1 allows 
time-lapse in vivo imaging of dendritic spines. b, A doublet microendoscope 
projects the laser scanning pattern onto the specimen plane in the tissue. 
Inset: red lines indicate optical ray trajectories. c, CA1 dendritic spines in a 
live Thy1-GFP mouse. d, Time-lapse image sequences. Arrowheads indicate 
spines that either persist across the sequence (white arrowheads), disappear 
midway (red), arise midway and then persist (green), arise midway and 
then later disappear (yellow), or disappear and then later appear at an 
indistinguishable location (cyan). Scale bars: 500 um (b, inset); 10 um (c); 
2 um (d). 


that it is possible to make quantitatively correct inferences about spine 
kinetics provided that one properly accounts for the optical resolution. 
Using the same framework, we next analysed real data. 

Initial analyses focused on four mice in which we acquired image 
stacks of CA1 pyramidal cells and tracked spines every 3 days for 21 
days (60 dendrites total; 50 + 7 (mean + s.d.) per session) (Fig. 3a). 
Whenever individual spines appeared at indistinguishable locations 
on two or more successive sessions, we identified these as observa- 
tions of the same spine. Overall, we made 4,903 spine observations 
(613 + 71 (mean + s.d.) per day). Spine densities were invariant over 
time (Fig. 3b) (Wilcoxon signed-rank test; n = 16-50 dendrites per 
comparison of a pair of days; significance threshold = 0.0018 after 
Dunn-Sidak correction for 28 comparisons; P > 0.047 for all compar- 
isons), as were spine volumes (P = 0.87; = 43 spines, Kruskal-Wallis 
analysis of variance (ANOVA)) (Extended Data Fig. 10) and the turn- 
over ratio, the fraction of spines arising or vanishing since the last 
session’ (Fig. 3b) (Wilcoxon signed-rank test; 14-40 dendrites; signifi- 
cance threshold = 0.0025 after correction for 20 comparisons; P > 0.31 
for all comparisons). Fewer than 50% of spines (46 + 2%; mean + s.e.m.; 
n = 4 mice) were seen throughout the experiment (Extended Data 
Fig. 1d, e), although our simulations had shown that this naive obser- 
vation overestimated spine stability. 

The time invariance of spine densities and turnover ratios implied 
that through our mathematical framework we could determine the 
underlying kinetic parameters governing turnover. S(t) curves for 
the total and newborn spine populations ostensibly resembled those 
reported for the neocortex’ (Fig. 3c). However, unlike in the neocor- 
tex, the 46% of spines seen in all sessions differed notably from the 
odds that a spine was observed twice in the same location across 
two distant time points, as quantified by the asymptotic value of S(t) 
(73 + 3%). This discrepancy suggested that, as our modelling had 
indicated, many CA1 spines might vanish and reappear in an ongoing 
way at indistinguishable locations. 
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Figure 2 | A simple kinetic model is sufficient to describe CA1 pyramidal 
cell spine dynamics. a, Two-photon microendoscopy and STED imaging of 
the same dendrites in vitro. Top, two-photon images depict spines closer than 
the resolution limit as merged entities. Bottom, asterisks mark example visually 
scored spines, showing cases in which nearby spines do (right) or do not (left) 
merge. b, Fraction of spines (n = 151 total) seen by two-photon imaging that 
were one, two or three spines as determined by STED imaging. Black bars show 
mean + s.d. for 12 dendrites. c, Separations between adjacent unmerged spines 
and pairs of spines that appeared merged by two-photon imaging. Open grey 
circles mark individual results from each of n = 150 spines. Black bars show 
mean + s.d. d, Example computer-simulated, time-lapse image sequence used 
to quantify how resolution limits impact measured spine densities and 
dynamics. e, Computational modelling predicts the underestimation of spine 
density due to the finite optical resolution. Blue diagonal line: perfect detection 
of all spines. Black horizontal dashed lines: typical ranges of spine densities on 
pyramidal cells in neocortex and hippocampus. Red data: results from visually 
scoring simulated images of dendrites of varying spine densities. Black curve: 
prediction from the scoring model using 600 nm as the minimum separation 
between two spines correctly distinguished. f, Modelling predicts the 
overestimation of spine stability due to merging of adjacent spines in 
resolution-limited images. Blue data: survival fraction values (mean + s.e.m.) 
for actual spine turnover in computer simulations (spine density: 2.56 m7’). 
Red data: apparent turnover for these same simulated dendrites, as scored from 
simulated two-photon images. Black curves: theoretical predictions for spine 
survival based on the scoring model. Scale bars: 1 [um (a); and 2 jum (d). 


We next tested whether a prolonged environmental enrichment 
would alter spine turnover. Previous data from rats have indicated 
that basal CA1 spine density can rise ~10% after environmental 
enrichment’’. Data from mice are limited to CA1 apical dendrites 
and have yielded contradictory results'*” (Supplementary Discussion). 
In three mice we imaged a total of 55 basal dendrites (39 + 14 (s.d.) per 
day) across 16 sessions, 3 days apart (Fig. 3d). After session 8 we moved 
the mice to an enriched environment, where they stayed throughout 
sessions 9-16. We made 8,727 spine observations in total (545 + 216 
(s.d.) per day). 

Comparisons of baseline and enriched conditions revealed no dif- 
ferences in spine density or turnover (Fig. 3e) (n = 10-53 dendrites; 
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Figure 3 | NMDA receptor blockade, but not environmental enrichment, 
altered spine turnover dynamics. a, Schedule of baseline imaging sessions. 
b, Neither measured spine densities (top) nor turnover ratios (bottom) varied 
for mice in their home cages. Horizontal lines: mean spine density and turnover 
ratio. c, Spine survival (top) and newborn spine survival (bottom). d, Schedule 
for study on environmental enrichment. e, f, No significant differences existed 
between baseline (black points) and enriched conditions (red) regarding spine 
density (e, top), turnover (e, bottom), survival (f, top), or newborn spine 
survival, (f, bottom). g, Schedule for the study on NMDA receptor blockade. 
h, MK801 caused a significant decline in spine density. Data are from mice 
imaged four times before (black) and six times during (green) MK801 
administration. Black and green horizontal lines respectively indicate mean 
densities during baseline and on the last 4 days of MK801 treatment. The 
density decrease was highly significant (Wilcoxon signed-rank test; n = 29 
dendrites; P = 0.0007). i, The decline in spine density early in MK801 treatment 
arose from a transient, highly significant difference between the rates at which 
spines were lost (darker bars) and gained (lighter bars) (Wilcoxon signed-rank 
test; n = 29 dendrites; P = 0.0008). Greyscale and green-shaded bars represent 
percentages of spines gained and lost for sessions before and during MK801 
dosage. ****P < 0,001. We normalized spine densities to their mean values in 
baseline conditions, which were 1.03 um? (b, top), 0.90 um! (e, top) and 
0.78 um”! (h). All error bars are s.e.m. for dendrites. 


P> 0.10, 8 paired comparisons of density; P > 0.039, 7 paired compar- 
isons of turnover ratio; Wilcoxon signed-rank tests with significance 
thresholds of 0.006 and 0.007, respectively, after corrections for 
multiple comparisons). Neither were there differences in spine survival 
(P > 0.057; 7 time points; n = 10-49 dendrites; Wilcoxon signed-rank 
test; significance threshold of 0.007 after Dunn-Sidak correction), nor 
in newborn spine survival (P > 0.29; 6 time points; n = 5-35 dendrites; 
significance threshold of 0.008; Fig. 3f). Thus, in mice, continuous 
enrichment does not substantially alter spine dynamics on CA1 basal 
dendrites. Nevertheless, mean volumes of stable spines underwent a 
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slight (7+ 3% (s.e.m.)) but significant decline upon enrichment 
(Wilcoxon signed-rank test; P = 0.007; 60 spines tracked for 16 ses- 
sions) (Extended Data Fig. 10d). Data on the structural effects of long- 
term potentiation (LTP) suggest an explanation of these findings; 
CAI spine densities rise transiently after LTP induction but return to 
baseline values 2 h later’', implying that continual enrichment would 
cause no net change in spine densities (Supplementary Discussion). 

We next examined whether blockade of NMDA glutamate recep- 
tors impacts spine turnover. These receptors are involved in multiple 
forms of neural plasticity, including in the CA1 area”’. In the neocor- 
tex, NMDA receptor blockade stabilizes spines by slowing their elim- 
ination while keeping their formation rate unchanged". We tracked 
CAI spines across 10 sessions at 3-day intervals in mice receiving the 
NMDA receptor blocker MK801 beginning after session 4 and onward 
(Fig. 3g). We examined 26 dendrites (25 + 1.4 (s.d.) per session), made 
5,020 spine observations (502 + 32 (s.d.) per day), and found that 
MK801 induced a significant decline (12 + 3% (s.e.m.)) in spine den- 
sity (Wilcoxon signed-rank test; 25 dendrites; P = 0.0007) (Fig. 3h). 
This stemmed from a transient disparity in the rates of spine loss 
versus gain (loss rate was 215 + 42% (s.e.m.) of the rate of gain; 
Wilcoxon signed-rank test; 25 dendrites; P = 0.0008) (Fig. 3i). 
These results indicate that the survival odds of CA1 spines depend 
on NMDA receptor function, illustrate our ability to detect changes in 
spine dynamics, and show that CA1 and neocortical spines have diver- 
gent responses to NMDA receptor blockade. 

To ascertain the underlying time constants governing spine turn- 
over, we compared the S(t) curves of visually scored spines to predic- 
tions from a wide range of candidate kinetic models (Fig. 4a). In each 
model there was a subset (0-100%) of permanent spines; the remaining 
spines were impermanent, with a characteristic lifetime, t. Since envir- 
onmental enrichment left the observed spine dynamics unchanged, we 
pooled the baseline and enriched data sets to extend the analyses to 
longer time-scales (Fig. 4b). By varying the actual spine density, frac- 
tion of stable spines, and characteristic lifetime for the unstable frac- 
tion, we identified the model that best fit the S(f) curves, using a 
maximum likelihood criterion (Supplementary Information). This 
model had 100% impermanent spines, with an actual density of 2.6 
jum * and t of ~10 days (Fig. 4a, b). This is twice the ~5-day lifetime 
reported for the transient subset of neocortical spines’°. There were 
also models with both permanent and impermanent spines that gave 
reasonable, albeit poorer fits to the CA1 data (Fig. 4a, b). Crucially, our 
analysis identified all models whose fits were significantly worse than 
the best model (white regions in Fig. 4a; P< 0.05, likelihood-ratio test); 
we regarded these as unsatisfactory in accounting for the CA1 data 
(Supplementary Information). 

Our results pointing to a single population of unstable spines in CA1 
contrast markedly with findings in adult neocortex, where >50% of 
spines seem permanent’ °. To make even-handed comparisons, we used 
our framework to re-analyse published data acquired in the mouse 
somatosensory neocortex that had supported this conclusion’. Owing 
to the lower density of neocortical spines, merging is far less of a concern 
(Fig. 4c), and our modelling confirmed that ~60% of neocortical spines 
are stable over very long timescales, supporting past conclusions’. 

Nevertheless, we found very significant differences between CA1 
and neocortical spine turnover dynamics (Fig. 4a; P = 0.01, likelihood 
ratio test). Models with only impermanent spines, which well described 
CAI, were insufficient (P< 10°, likelihood ratio test) for neocortex 
(Fig. 4a, d). The discrepant lifetimes of impermanent spines in the two 
areas (~10 days (CA1) versus ~5 days (neocortex)) posed further 
incompatibility (Fig. 4a, e). Conversely, models that explained neocor- 
tical spine turnover were incompatible with the CA1 data (P = 0.01, 
likelihood ratio test). Modelling alone cannot eliminate the possibility 
that CA1 basal dendrites have some permanent spines, but if any such 
spines exist they compose a far smaller fraction than in the neocortex. 
Hence, CA1 and the neocortex have distinct spine dynamics, percen- 
tages of impermanent spines, and turnover time constants. 
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Figure 4 | CA1 and neocortical spines exhibit distinct turnover kinetics. 

a, Multiple kinetic models are consistent with data on spine survival. Each 
model considered had two subpopulations: permanent and impermanent 
spines. Abscissa: mean lifetime for impermanent spines. Ordinate: fraction of 
spines that are permanent. Each datum is for an individual model; colour 
denotes the level of statistical significance at which the model could be rejected. 
Red points denote models that best fit the data. No results are shown for models 
incompatible with the data (P < 0.05). Models that best fit data from CA1 have 
~100% impermanent spines, with a ~ 10 day lifetime (green arrowhead). There 
are also models with permanent subpopulations that cannot be statistically 
rejected (for example, red arrowhead). Models that best fit patterns of 
neocortical spine turnover (black arrowhead), from mice age- and gender- 
matched’ to those used here, have ~50-60% permanent spines and a shorter 
lifetime (~5 days) for impermanent spines than in CA1. Models lacking 
permanent spines poorly fit the neocortical data; grey arrowhead marks the 
model for the grey curve in d. The four arrowheads indicate the models that 
generated the colour-corresponding curve fits in b, d. b, Empirically deter- 
mined survival curve for CA1 spines (black data: mean + s.e.m.; data set of 
Fig. 3e, f) over 46 days, compared to predictions (solid curves) from two of the 
models in a (green and red arrowheads in a). Green curve: best-fitting model, 
which has no permanent spines. Red curve: an example model with both 
stable and unstable spines. c, Owing to the higher density of spines in CA1 than 
in the neocortex (inset), optical merging is far more common in CA1. Given 
what appears to be one spine, vertical bars represent the probability as 
determined from the computational model that the observation is actually of 
1-5 spines. Probabilities were calculated using spine density values of the 
inset, 0.75 um as the minimum separation, L, needed to distinguish adjacent 
spines. Error bars: range of results for L within 0.5-1.0 jim. d, Empirically 
determined survival curves for neocortical spines (data set from ref. 7) over 28 
days, compared with predictions (solid curves) from two different models for 
spine turnover in a (grey and black arrowheads in a). Black curve: best fit 
attained with a stable subpopulation of spines. Grey curve: a model lacking 
permanent spines, poorly fitting the data. e, Spines in CA1 and neocortex differ 
substantially in proportions of permanent versus impermanent spines, spine 
lifetimes, effects of NMDA receptor blockade, and learning or novel experience. 


Further distinguishing CA1 and the neocortex are the contrary roles 
of NMDA receptor blockade (Figs 3h, i and 4e). In the neocortex, 
MK801 promotes stability by decreasing spine loss’? and blocking 
spine addition’’; conversely, NMDA receptor activation may speed 
turnover via addition of new spines and removal of pre-existing 
neocortical spines supporting older memories. In CA1, MK801 speeds 
turnover and promotes instability (Fig. 3h, i), suggesting that NUDA 
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receptor activation may transiently slow turnover and stabilize spines. 
Indeed, LTP induction in CA] is associated with stabilization of exist- 
ing spines and growth of new spines*??**. 

A natural interpretation is that spine dynamics may be specialized by 
brain area to suit the duration of information retention. The neocortex, 
a more permanent repository, might need long-lasting spines for 
permanent information storage and shorter-lasting ones ready to be 
stabilized if needed*’. The hippocampus, an apparently transient 
repository of information, might only require transient spines. The 
~1-2-week mean lifetime for the ~100% impermanent CA1 spines 
implies a near full erasure of synaptic connectivity patterns in ~3-6 
weeks, matching the durations that spatial and episodic memories are 
hippocampal-dependent in rodents'* (but see also ref. 25). The 
ensemble place codes of CA1 neurons also refresh in ~1 month’’, 
which could arise from turnover of the cells’ synaptic inputs. Since 
75-80% of CA3 — CA] inputs are monosynaptic”*, CA1 spine imper- 
manence probably implies a continuous re-patterning of CA3 — CA1 
connectivity throughout adulthood, which, owing to the sheer number 
of synapses and sparse connections is unlikely to assume the same 
configuration twice. 

Supporting these interpretations, artificial neural networks often 
show a correspondence between synapse lifetime and memory longev- 
ity’’, although in some models spine turnover and memory erasure can 
be dissociated**””. Computational studies also show that elimination of 
old synapses can enhance memory capacity’. More broadly, net- 
works that can alter synaptic lifetimes, not just connection strengths, 
can more stably store long-term memories while rapidly encoding 
new ones”’. 

The data described here are consistent with a single class of CA1 
spines, but future studies should examine both the finer kinetic fea- 
tures and cellular or network mechanisms of turnover’*. By using 
fluorescence tags to mark spines undergoing plastic changes, in vivo 
imaging might help relate connection strengths, spine lifetimes and 
memory performance. Finally, researchers should investigate spine 
turnover in animals as they learn to perform a hippocampal-depend- 
ent behaviour, to build on the results here by looking for direct rela- 
tionships between CA1 spine stability and learning. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Animals and surgical preparation. Stanford University’s Administrative Panel 
on Laboratory Animal Care approved all procedures. We imaged neurons in mice 
expressing GFP driven by the Thy1 promoter’! (heterozygous males 10-12 weeks 
old, GFP-M lines on a C57BL/6 X F1 background). We did not perform any 
formal randomization in the assignment of mice to specific groups, but we 
informally selected mice in a random manner without use of any exclusion 
criteria. We performed surgeries as previously published* but with a few modifica- 
tions. We anaesthetized mice using isoflurane (1.5-3% in O) and implanted a 
stainless steel screw into the cranium above the brain’s right hemisphere. We 
performed a craniotomy in the left hemisphere (2.0 + 0.3 mm posterior to bregma, 
2.0 + 0.3 mm lateral to midline) using a 1.8-mm-diameter trephine and implanted 
the optical guide tube with its window just dorsal to, but not within, area CA1, 
preserving the alveus. 

Guide tubes and microlenses. Guide tubes were glass capillaries (1.5 mm (ID), 1.8 
mm (OD) or 2 mm (ID), 2.4 mm (OD); 2-3 mm in length). We attached a circular 
coverslip, matched in diameter to that of the capillary’s outer edge, to one end 
of the guide tube by using optical epoxy (Norland Optical Adhesive 81). We used 
1.0-mm-diameter micro-optical probes of diffraction-limited resolution (0.8 NA, 
250 jum working distance in water) that were encased in a 1.4-mm-diameter 
sheath". 

In vivo two-photon imaging. We used a modified commercial two-photon 
microscope (Prairie Technologies) equipped with a tuneable Ti:Sapphire laser 
(Chameleon, Coherent). We tuned the laser emission to 920 nm and adjusted 
the average illumination power at the sample (~5-25 mW) for consistency in 
signal strength across imaging sessions in each mouse. For microendoscopy we 
used a 20 X 0.8 NA objective (Zeiss, Plan-Apochromat) to deliver illumination 
into the microlenses. In some cases we imaged directly through the glass cannula 
using a Olympus LUMPlan FI/IR 0.8 NA X40 water immersion objective lens and 
confirmed that the optical resolution in all three spatial axes was identical between 
the two approaches. Beginning at 15-18 days after surgery, we imaged mice every 3 
days under isoflurane anaesthesia (1.5% in O,) for a total of 8-16 sessions each 
lasting 60-90 min. We imaged some mice at irregular intervals up to 80 days. 
MK801 treatment. In mice subject to the protocol of Fig. 2d, after the fourth 
imaging session we administered MK801 (Tocris Bioscience; 0.25 mg g ' body 
weight; dissolved in saline) in two intraperitoneal injections each day (8-10 h 
apart) as described previously”. 

Enriched environment. Animals given an enriched environment had a larger 
cage (42 (length) < 21.5 (width) X 21.5 (height) cm’) that contained a running 
wheel, objects of various colours, textures and shapes, plastic tunnels, and food 
with different flavours. We changed the objects, as well as their placements within 
the cage, every 3-4 days to encourage exploration and maintain novelty. We 
provided food and water ad libitum. 

Histology. At the end of in vivo experimentation, we deeply anaesthetized mice 
with ketamine (100 mg kg’) and xylazine (20 mg kg"). We then perfused PBS 
(pH 7.4) into the heart, followed by 4% paraformaldehyde in PBS. We fixed 
brains overnight at 4°C and prepared floating sections (50 mm) on a vibrating 
microtome (VT1000S, Leica). Before in vitro imaging of GFP fluorescence, we 
washed the fluorescent sections with PBS buffer several times and quenched them 
by incubation in 150 mM glycine in PBS for 15 min. After three washes in PBS, we 
mounted sections with Fluoromout-G (Southern Biotech). We inspected the 
sections using either two-photon fluorescence imaging or a STED microscope 
(Leica TCS STED CW, equipped with a Leica HCX PL APO 100 X 1.40 NA oil- 
immersion objective.). 

For immunostaining sections were washed with PBS buffer several times 
before quenching and permeabilization (15 min incubation in 0.1% Triton-X 
in PBS). Sections were incubated in blocking solution (1% Triton X-100, 2% 
BSA, 2% goat serum in PBS) for 4 h. Primary antibodies (rat anti-CD68, FA11- 
ab5344, Abcam, 1:100 dilution; mouse anti-GFAP, MAB3402, Millipore, 1:500 
dilution) were diluted in blocking solution and sections were incubated over- 
night in this solution. The following day sections were washed with PBS and 
incubated in diluted secondary antibody: Cy5-conjugated goat anti-mouse IgG, 
A10524 and Cy5-conjugated goat anti-rat IgG, A10525; both Molecular Probes; 
both 1:1,000 dilution, in blocking solution for 3 h. All staining procedures were 
done at room temperature. After three washes in PBS, sections were mounted 
with Fluoromout-G (Southern Biotech). Histological specimens were inspected 
on a confocal fluorescence microscope (Leica SP2 AOBS). 

Imaging sessions. We mounted the isoflurane-anaesthetized mice on a stereo- 
tactic frame. To perform microendoscopy we fully inserted the microendoscope 
probe into the guide tube such that the probe rested on the guide tube’s glass 
window. To attain precise and reliable three-dimensional alignments across all 
imaging sessions of the brain tissue undergoing imaging, we used a laser-based 
alignment method. We positioned a laser beam such that when the mouse’s 


LETTER 


head was properly aligned, the beam reflected off the back surface of the 
microendoscope and hit a designated target. This ensured that at each imaging 
session the long axis of the microendoscope was perpendicular to the optical 
table to within ~1 angular degree. Otherwise we inserted a drop of water in the 
cannula and imaged using the water immersion objective. As previously 
described", we experimentally confirmed that the resolution limits of the two 
approaches were essentially identical. 

Image acquisition. During the first imaging session, we selected several regions of 
brain tissue for longitudinal monitoring across the duration of the time-lapse 
experiment. Each of these regions contained between 1 and 7 dendritic segments 
visibly expressing GFP. In each imaging session, we acquired 6-8 image stacks of 
each selected regions using a voxel size of 0.0725 X 0.0725 X 0.628 pm’. 

Image pre-processing. To improve the visual saliency of fine details within each 
image stack, initial pre-processing of the image stacks involved a blind decon- 
volution based on an expectation-maximization routine (Autodeblur from 
Autoquant). We then aligned all individual images acquired at the same depth 
in tissue using the TurboReg plug-in routine for ImageJ. Finally, we averaged 
pixel intensities across the aligned stacks, yielding a single stack that we used in 
subsequent analyses. 

Scoring of dendritic spines. We scored spines using a custom MATLAB interface 
that supported manual labelling of spines using the computer mouse, measure- 
ments of dendrite length and spine position, and alignments of time-lapse sets of 
image stacks. For each region of tissue monitored, we loaded all the image stacks 
acquired across time, such that the temporal sequence of the stacks was preserved 
but the experimenter was blind to their dates of image acquisition during spine 
scoring. We excluded images whose quality was insufficient to score spines. 

We scored spines similarly to as described previously” but with a few modifica- 
tions. We labelled protrusions as dendritic spines only if they extended laterally 
from the dendritic shaft by >0.4 um (Extended Data Fig. 6a, b). We did not include 
protrusions of <0.4 um in the analysis (Extended Data Fig. 6c, d). When a spine first 
appeared in the time-lapse image data we assigned it a unique identity. We pre- 
served the spine’s identity across consecutive time points if the distances between 
the spine in question and two or three of its neighbouring spines were stable. In 
ambiguous cases, which were hardly the norm, we required stability to <2 Jum. 

The surviving fraction, S(t), at time f was defined as the fraction of spines 
present on the first imaging day that were also present a time f later. For the 
deliberate purpose of attaining conservative estimates of (for example, lower 
bounds on) the proportion of impermanent spines, in the calculation of S(t) we 
handled the 18% of spines in the recurrent-location category (Fig. 1d and 
Extended Data Fig. 1d, e) in the following way. When checking pairs of images 
acquired an interval t apart, we deliberately did not distinguish between whether 
the second image contained the original spine or its replacement spine at the same 
location. This approach thereby underestimated spine turnover as inferred from 
analyses of S(t), implying that our conclusion of CA1 spine impermanence is not 
only mathematically conservative but also robust to any scoring errors in which we 
might have erroneously missed a spine that had in fact persisted to subsequent 
imaging sessions. 

Turnover ratio was defined as the sum of spines gained and lost between two 
consecutive time points normalized by the total number of spines present at these 
time points. Spines lost or gained were defined as the number of spines lost or 
gained between two consecutive time points, respectively, normalized by the total 
number of spines present at these time points. 

To make coarse estimates of spine volumes, for stable spines we determined 

each spine’s fluorescence within a manually drawn region of interest (ROI) in the 
axial section in which the spine head appeared at its biggest diameter. We normal- 
ized this value by the fluorescence value attained by moving the ROI to within the 
nearby dendritic shaft (as in ref. 7). 
Statistical analysis. To test for differences in spine densities, turnover ratios 
and surviving fractions either over time or between different groups, we used 
non-parametric two-sided statistical testing (Wilcoxon signed-rank, Mann- 
Whitney U and Kruskal-Wallis ANOVA tests) to avoid assumptions of normality 
and Dunn-Sidak correction for multiple comparisons. Sample size was chosen to 
match published work’. 

To compare experimentally measured spine survival to the theoretical predic- 
tions from kinetic modelling, we assessed the goodness-of-fit for each model by 
using both the reduced chi-squared statistic and the log-likelihood function 
(Supplementary Methods). Both the mean and covariance of the surviving frac- 
tion depended on the model parameters and influenced the goodness-of-fit 
(Supplementary Methods). We also required that the parameter describing the 
minimal separation needed to resolve two spines was 0.5-1 jum (other values for 
this minimal separation are implausible) (Supplementary Methods). 

Simulated data sets. We modelled the microscope’s optics on the basis of prior 
measurements" and tuned the kinetics of spine turnover, spine geometries and 
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dendrite geometries to produce simulated image sequences that the data analyst 
judged to be similar to the actual data (Supplementary Information and Extended 
Data Fig. 5). In some data sets, we matched the simulated spine kinetics to those 
inferred from our in vivo measurements. 
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Extended Data Figure 1 | In vivo imaging of CA1 spine dynamics over 
extended time intervals. a, In vivo, 80-day-long time-lapse image data set 
sampled at variable intervals. Each image shown is the maximum projection of 
4-8 images acquired at adjacent z-planes. Scale bar, 2 |1m. b, c, Direct 
empirical determinations of spine density (b) and spine survival (c) across the 
80 days. A normalized spine density of one corresponds to a measured spine 
density of 1.16 1m — 1 Data points are mean = s.e.m. for 16 dendrites. d, In vivo, 
22-day-long time-lapse image data set sampled every 3 days. Over 50% of the 
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spines underwent visually noticeable dynamic changes. Pie chart shows the 
proportions of spines that were persistent or exhibited different patterns of 
turnover (” = 1,075 total spines from 4 mice). Colour coding is the same as in 
Fig. 1d. Error bars are s.e.m. for four mice. e, Histograms show the distributions, 
for each class of spines, of the fraction of imaging sessions in which each spine 
was observed within the same 22-day data set of d. Colour coding is the same as 
in Fig. 1d. Error bars represent s.d. estimated as counting errors. 
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Extended Data Figure 2 | The chronic CA1 preparation induces a minimal, 
5-um-thick layer of glial activation and does not affect spine density. a—c, In 
two groups of mice, each comprising two Thy1-GFP transgenic animals (8-10 
weeks old), we implanted the imaging guide tube just dorsal to hippocampal 
area CA] following established procedures*'*”’. a, b, We euthanized, sliced and 
stained the first group after two further weeks (a), and the second group 
after five further weeks (b). Confocal fluorescence images of the stained tissue 
slices revealed activated microglia (CD68 staining, top, red), astrocytes (GFAP 
staining, bottom, red), GFP-expression pyramidal neurons (green), and 
permitted quantifications of spine density in CA1 regions both ipsilateral and 
contralateral to the implant. a, Two weeks after implantation, confocal 
photomicrographs (maximum intensity projection of four separate z-planes, 
axially spaced 0.2 [1m apart) revealed a limited presence of activated microglia 
(red arrowheads indicate single cells) on the implanted hemisphere (top right) 
but were virtually non-existent in the contralateral control CA1 from the 
same mice (top left). Staining for astrocytes on the implanted hemisphere 
(bottom right) was almost indistinguishable from the control hemisphere 
(bottom left), except for a 5-10-t1m-thick layer of astrocyte label abutting the 
optical surface of the imaging guide tube. b, Five weeks after implantation, 
confocal photomicrographs (maximum intensity projection of four separate 
z-planes, axially spaced 0.2 1m apart) revealed an almost undetectable presence 
of activated microglia on the implanted hemisphere (top right), comparable to 
the contralateral control hemisphere (top left). Staining for astrocytes in the 
implanted hemisphere (bottom right) was almost indistinguishable from the 
control hemisphere (bottom left), except for a 5-10-j1m-thick layer of astrocyte 
label abutting the optical surface of the imaging guide tube. c, Mean density 
of spines on pyramidal cell basal dendrites in the implanted CA1 (white 
columns) was statistically indistinguishable from the contralateral control 
CA1 (black columns), at 2 weeks (P = 0.21; Mann-Whitney U-test; n = 11 
dendrites) and at five weeks (P = 0.98; Mann-Whitney U-test; n = 11 
dendrites) after implantation. Error bars are s.d. 
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Extended Data Figure 3 | Two-photon and STED imaging of the same CA1 
spines in fixed tissue reveals that nearby spines can merge in two-photon 
images. a-c, Example two-photon microendoscopy (a), STED (b), and overlay 
(c) images of the same CA1 basal dendrite, acquired in a fixed brain slice from a 
Thy1-GFP mouse. Nearby spines that are clearly distinguishable in the STED 
image (green arrowheads) but within the diffraction-resolution limit of two- 
photon microendoscopy (0.85 NA) appear as single, merged entities within the 
two-photon image (red arrowheads). The two-photon image shown is the 
maximum intensity projection of three optical sections axially spaced 0.6 1m 
apart. The STED image is the maximum intensity projection of six optical 
sections spaced 0.3 tum apart. Scale, 2 jim. d, e, To attain an approximate 
measure of spine volume, we quantified each spine’s fluorescence in manually 
drawn regions of interest (ROIs) and normalized it by the fluorescence value 
in the nearby dendritic shaft, within an ROI of identical shape and size within a 
single axial section of the two-photon image stack. To ascertain whether each 
of the spines scored in the two-photon images was actually a merged spine or 
not, we consulted the STED images of the same dendrite. Plotted are the 
normalized fluorescence values (d) for unitary spines as well as doublet and 
triplet merged spines (black lines: mean values + s.d.; coloured points: 

data from individual spines), and the cumulative distributions of these 
measurements (e). The distributions of normalized fluorescence were statis- 
tically indistinguishable (P > 0.06; Kruskal-Wallis ANOVA; N = 100, 30 
and 7, respectively, for unitary, doublet and triplet spines), probably reflecting 
the substantial range of CA1 spine geometries (Extended Data Fig. 9). 
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Extended Data Figure 4 | The asymptotic value of the surviving fraction of 
spines exceeds the fraction of permanent spines. a, Surviving fraction curves 
for models in which the fraction of permanently stable spines is y = 0 (red 
curve) and y = 0.3 (purple curve) (Supplementary Methods). The timescale of 
spine survival was t = 1, and the filling fraction value was f = 0.2. The surviving 
fraction asymptotes to a value, S”, that encodes the fraction of stable spines. 
(Supplementary Information has a list of all mathematical variables used in this 
work, and their definitions). b, The time asymptotic value of the surviving 
fraction (green curve) exceeds the fraction of stable spines (dashed blue line). 
Coloured circles correspond to the surviving fraction curves plotted in a. 
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Extended Data Figure 5 | Examples of simulated imaging data sets and their 
scoring. a, Example simulated images (Supplementary Methods) of dendrites 
for which the spine density was 0.6 um! (top), 1.5 pm™! (middle) and 

2.4 um! (bottom). b, Ground truth and manual scoring of a simulated 
dendrite for which the spine density was 1.5 jum’. Green arrowheads indicate 
counting errors originating from the optical merging of spines (Supplementary 
Methods). Blue arrowheads indicate counting errors that occur when the 
spine’s projection into the optical plane is too short (Supplementary Methods). 
Scale bar, 2 um. ¢, The visually scored surviving fraction (red circles) differed 
from the true spine surviving fraction (blue triangles), but the departures were 
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well predicted by the kinetic model (solid black curve) (Supplementary 
Methods). d, We simulated and scored a long-term lapse imaging data set with 
kinetic parameters that matched the best-fit model of Fig. 4b (Supplementary 
Methods). Even though the data set lacked stable spines, many simulated 
spines appeared to persist for long time intervals. Scale bar, 2 jim. e, Although 
the spine density in the simulated data was 2.56 m7‘, visual scoring yielded 
a lower spine density. We used the measured and true spine densities to 
estimate the extent of merging and the counting resolution (Supplementary 
Methods). Data points are mean + s.e.m for 20 simulated dendrites (c) or 10 
simulated dendrites (e). 
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Extended Data Figure 6 | Variability of imaging angles has virtually no 
impact on determinations of spine turnover. a—-g, We examined empirically 
whether variations in dendritic angle across different imaging sessions might 
impact determinations of spine turnover. However, the variations in dendritic 
angle that were actually present in our data sets were insufficient to cause 
illusory turnover. a, b, Dendritic spines can be detected when the angle (0) 
between a spine and the normal vector is large (Supplementary Methods). View 
of a dendrite and spine in the (x, z) (a) and optical (x, y) (b) planes. 

c, d, Dendritic spines cannot be detected when the angle between a spine and 
the normal vector () is small (Supplementary Methods). View of a dendrite 
and spine in the (x, z) (c) and optical (x, y) (d), planes. e, For every dendrite and 
time point, we estimated the dendrite’s angle with respect to the optical plane 
using the three-dimensional coordinates of two manually labelled points on the 
dendrite chosen to flank the region of dendrite containing the scored spines. 
Over time, individual dendrites varied about their initial angle (n = 55 
dendrites tracked over 16 sessions; data set of Fig. 3d). f, Distribution of the 
fluctuations in angle, pooled across the 55 dendrites, relative to the initial angle 
as seen in the first imaging session. The average magnitude of an angular 
fluctuation was 4.5°, and 90% of angular fluctuations were <10° in magnitude. 
Thus, a 5° fluctuation was typical in our data set, whereas a 10° fluctuation was 
atypically large. g, To determine if variability in the imaging angle might impact 
determinations of spine turnover, we imaged 18 dendrites in fixed slices 
while deliberately tilting the imaging plane by 0°, 5° and 10°. We made a total of 
989 spine observations. Over 95% of spines scored in the 0° condition were 
also correctly scored when the specimen was tilted by 5° or 10°. Overall, the 
level of angular fluctuations in the in vivo imaging data has virtually no impact 
on turnover scores. 
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Extended Data Figure 7 | Kinetic modelling well describes how optical 
merging affects spine turnover dynamics as monitored with finite optical 
resolution. a, Diagram of the kinetic scheme used to describe merged spine 
dynamics (Supplementary Methods). Each state is labelled by the number of 
actual spines that have merged in appearance to a single spine (a quantity that 
we call the merged spine order; Supplementary Methods). State ‘0’ indicates the 
absence of a spine, and state ‘1’ indicates a spine that is truly unitary. 
Transitions occur between adjacent states in the kinetic ladder diagram with 
rate constants ry. b, The rate constants governing increases in merged spine 
order depend on two parameters (Supplementary Methods): (1) the initial state 
or merged spine order; and (2) the overall degree of merging in the spine image 
data set, which is proportional to the product of the spine density and the 
shortest resolvable interspine interval (denoted L) (Supplementary Methods). 
By contrast, the rate constants governing decreases in merged spine order 
(inset) depend only on the initial merged spine order (Supplementary 
Methods). ¢, In the case when all spines are labile, a collapsed kinetic scheme in 
which a single state (Y) combines all merged spine orders above zero 
approximates the complete model (d) and can be solved mathematically 
(Supplementary Methods). d, The surviving fraction curve generated from the 
collapsed kinetic scheme (blue curve) fits the empirically observed surviving 
fraction (black data points) as well as the best-fit model (red). e, The asymptotic 
value of the surviving fraction is a function of the degree of merging 
(Supplementary Methods). A large degree of merging (as in CA1, blue circle) 
produces a larger asymptotic value of the merged spine surviving fraction 
than a small degree of merging (as in the neocortex, purple circle). f, The 
estimated lifetime of merged spines is a function of the degree of merging 
(Supplementary Methods). A large degree of merging (as in the hippocampal 
CAI, blue circle) produces a longer relative lifetime of merged spines than a 
small degree of merging (as in the neocortex, purple circle). 
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Extended Data Figure 8 | Dynamic spine geometries induce modest levels 
of apparent spine turnover that cannot explain the turnover measured in 
vivo. a—f, To study potential effects of fluctuations in spine geometry, we used 
values for the means and variances of dendrite radius, spine length and spine 
radius that were determined by electron microscopy'”**. We then computa- 
tionally examined how time-dependent fluctuations in these parameters would 
affect determinations of spine surviving fraction (Supplementary Methods). 
a, We examined how fluctuations in dendrite radius, spine radius, spine length 
and spine angle—individually (coloured data points) and all together (black 
data)—affect the spine surviving fraction when the fluctuating geometric 
parameters are chosen stochastically in each of two imaging sessions, as a 
function of the parameter’s time correlation between the two sessions 
(Supplementary Methods). As expected, when the two sessions involved image 
pairs that were perfectly correlated, the surviving fraction reached 100%. 
Fluctuations in all four parameters had greater effects than fluctuations in 
individual geometric parameters. b, To estimate the time dependence of the 
surviving fraction of scorable spines from a, we assumed all geometric 
parameters evolved according to the time-correlation function that we 
empirically determined from in vivo imaging data (Extended Data Fig. 9d). 

c, The apparent surviving fraction is the product of the true surviving fraction 
and the surviving fraction of scorable spines (Supplementary Methods). For 
the best-fit kinetic model, the apparent surviving fraction is very close to the 
true surviving fraction. d, The difference between the fitted timescale of the 
apparent surviving fraction and the true survival timescale is small across the 
range of model parameters consistent with the in vivo data (Supplementary 
Methods). e, The graph plots the lower bound of the surviving fraction of 
scorable merged spines as a function of the time-correlation function shared by 
all four geometric parameters, for different merged spine orders. As this 
lower bound increases rapidly with the merged spine order, artefactual 
turnover due to unscorable spines is unlikely when spine merging is common 
(Supplementary Methods). f, We combined Fig. 4c and Extended Data 

Fig. 8e to bound the turnover that could result from unscorable spines 
(Supplementary Methods). As the empirically measured surviving fraction 
falls below the lower bound obtained for the surviving fraction of scorable 
merged spines, ongoing changes in the geometric parameters of spines cannot 
account for the observed spine turnover. 
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Extended Data Figure 9 | Dynamics of spine geometries measured in vivo. 
a, Time courses of the spine length, measured from the border of the dendritic 
shaft to the centre of the spine, for five example spines tracked over eight 
imaging sessions (left). Distribution of spine lengths (right; n = 344 spine 
observations). b, Time courses of the spine radius, measured from the border to 
the centre of the spine, for five example spines tracked over eight imaging 
sessions (left). Distribution of spine radii (right; n = 344 spine observations). 
c, Time courses of the dendritic radius, measured from the border to the centre 
of the dendrite, at the location of five example spines tracked over eight imaging 
sessions (left). Distribution of all dendritic radii (right; n = 344 dendrite 
observations). d, Experimental spine length time-correlation function and its 
exponential fit (Supplementary Methods). 
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Extended Data Figure 10 | Volumes of stable spines fluctuate minimally 
over time. a-d, To attain an approximate measure of spine volume for stable 
spines, we quantified each spine’s fluorescence in manually drawn regions of 
interest (ROIs) and normalized it by the fluorescence value in the nearby 
dendritic shaft, as determined within a ROI of identical shape and size in a 
single z-section image acquired by two-photon microendoscopy. In addition, 
each spine’s fluorescence value at each time point is normalized to its own mean 
over the entire experiment. a, b, Mean (+ s.e.m.) fluorescence intensities of all 
spines (a), and individual spines (b), from a set of 43 stable spines, across a 


21-day span during which mice (n = 4) were in their home cages (same data as 
for Fig. 3a). Dashed black line in a indicates the mean over all imaging sessions. 
c, The correlation functions of spine radius (green), length (blue), fluorescence 
(black) and dendritic radius (red) are indistinguishable from each other. 

d, Mean (+ s.e.m.) fluorescence intensities of 61 stable spines across a 46-day 
span during which mice (n = 3) initially resided in their home cages (black 
data points) but later moved to an enriched environment (red points) (same 
data set as for Fig. 3d). Dashed black and red lines respectively denote the mean 
values over the baseline and enriched periods. 
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Stem cells integrate inputs from multiple sources. Stem cell niches 
provide signals that promote stem cell maintenance’, while dif- 
ferentiated daughter cells are known to provide feedback signals 
to regulate stem cell replication and differentiation* *. Recently, 
stem cells have been shown to regulate themselves using an auto- 
crine mechanism’. The existence of a ‘stem cell niche’ was first 
postulated by Schofield in 1978 to define local environments 
necessary for the maintenance of haematopoietic stem cells’. 
Since then, an increasing body of work has focused on defining 
stem cell niches’*. Yet little is known about how progenitor cell 
and differentiated cell numbers and proportions are maintained. 
In the airway epithelium, basal cells function as stem/progenitor 
cells that can both self-renew and produce differentiated secretory 
cells and ciliated cells*°. Secretory cells also act as transit-amp- 
lifying cells that eventually differentiate into post-mitotic ciliated 
cells”"° . Here we describe a mode of cell regulation in which adult 
mammnalian stem/progenitor cells relay a forward signal to their 
own progeny. Surprisingly, this forward signal is shown to be 
necessary for daughter cell maintenance. Using a combination 
of cell ablation, lineage tracing and signalling pathway modu- 
lation, we show that airway basal stem/progenitor cells continu- 
ously supply a Notch ligand to their daughter secretory cells. 
Without these forward signals, the secretory progenitor cell pool 
fails to be maintained and secretory cells execute a terminal dif- 
ferentiation program and convert into ciliated cells. Thus, a 
parent stem/progenitor cell can serve as a functional daughter 
cell niche. 

To establish whether post-mitotic ciliated cells send a conventional 
feedback signal to regulate the replication of their parent stem and 
progenitor cells, we genetically ablated ciliated cells using FOXJ1- 
creER; LSL-DTA (Rosa26R-DTA) mice (hereafter referred to as 
FOXJ1-DTA) (Fig. 1a). Following ciliated cell ablation, the absolute 
numbers and morphology of secretory progenitor cells (SCGB1A1") 
and basal stem/progenitor cells (CK5") remained unchanged despite 
the ablation of 78.8% of ciliated cells (on day 5, a total of 24.29 + 0.3% of 
all epithelial cells in control mice (identified with the nuclear marker 
4',6-diamidino-2-phenylindole, DAPI*) were FOXJ1* ciliated cells 
versus 5.13 + 0.4% in tamoxifen-treated mice (n = 3 mice)) (Fig. 1b, 
c and Extended Data Fig. 2a, b). Surprisingly, we did not observe the 
anticipated increase in stem or progenitor cell proliferation and/or their 
differentiation to replenish missing ciliated cells (Extended Data Fig. 
2c-e). Even over extended periods of time, the rates of epithelial pro- 
liferation remained similar to those of uninjured controls (Extended 
Data Fig. 2d). The number of ciliated cells increased at a rate that 
corresponds to the normal rate of ciliated cell turnover (Fig. 1d). 
Following ciliated cell ablation, ciliated cell turnover occurs with a 
half-life of 149 days (Fig. le) which mirrors the reported steady-state 
half-life of approximately 6 months"’. Additionally, the mesenchymal, 


haematopoietic, endothelial and smooth muscle cell populations 
appeared unchanged (Extended Data Fig. 2f, g). 

Lacking evidence to support the presence of a feedback mechanism 
to restore ciliated cell numbers after ablation, we wondered whether 
basal stem/progenitor cells might regulate secretory daughter cell 
behaviour by regulating the differentiation of secretory cells into cili- 
ated cells. Thus, we ablated basal cells and simultaneously traced 
the lineage of secretory progenitor cells using Scgblal-creER;LSL- 
YFP;CK5-rtTA;tet(O)DTA mice (hereafter referred to as SCGB1A1- 
YFP;CK5-DTA), as previously described’* (Fig. 1f). In addition to 
the dedifferentiation of secretory cells we previously described follow- 
ing stem cell ablation’*, we observed an increase in lineage-labelled 
yellow fluorescent protein (YFP*) cells expressing the ciliated cell 
marker FOXJ1 (8.1 + 1.6% of YEP* cells were FOXJ1~ in controls 
versus 42.4 + 1.0% in experimental animals) and an accompanying 
decrease in YFP’ SCGBIAI™ secretory cells (88.5 + 4% versus 
45 + 3%) (n = 3 mice) (Fig. 1g, h). We again observed that ~8% of 
lineage-labelled secretory cells dedifferentiated into basal cells as prev- 
iously described’’. Thus, we can now account for the fates of all lineage- 
labelled secretory cells after stem cell ablation, as the decrement in 
secretory cell lineage label (43.5%) is almost precisely equal to the 
combined increase in lineage-labelled ciliated and basal cells (34% 
and 8%, respectively). Importantly, lineage-labelled ciliated cells 
expressed c-MYB, a transcription factor required for ciliogenesis’*"* 
and acetylated tubulin (AcTub) confirming that secretory cells differ- 
entiated into mature ciliated cells (Extended Data Fig. 3a, b). These 
results were confirmed by flow cytometry (Extended Data Fig. 3c). In 
contrast to the changes in the tracheal epithelium in which the total 
number of ciliated cells increased twofold (625 + 29 versus 1,208 + 93 
ciliated cells, representing 24.5 + 1.5% and 61 + 4.7% of total cells, 
respectively) (Extended Data Fig. 3d), the underlying mesenchyme 
remained unchanged in morphology and its complement of haemato- 
poietic, endothelial and smooth muscle cells (Extended Data Fig. 3e, f). 

As the Notch pathway has been shown to regulate ciliated versus 
secretory cell fate choices in the embryonic lung and regenerating adult 
airway epithelium’*”°, we assessed the expression of Notch pathway 
components in each cell type of the adult homeostatic airway epithe- 
lium. Quantitative real time PCR analysis on purified airway epithelial 
cells revealed that the Notch1 receptor was highly expressed in basal 
stem/progenitor cells as previously reported'’, Notch2 and Notch3 
were significantly enriched in secretory progenitor cells, and Notch4 
was not detected (n = 3 mice) (Fig. 2a and Extended Data Fig. 4a). 

Signalling through the Notch2 receptor has previously been postu- 
lated to regulate secretory cell fate in the embryonic lung”, in inflam- 
matory cytokine-induced goblet cell metaplasia”’, and we have found it 
to be activated during secretory cell fate commitment during regen- 
eration”’. We found that steady-state nuclear Notch2 intracellular 
domain (N2ICD) expression was restricted to secretory progenitor 
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Figure 1 | Secretory progenitor cells differentiate into ciliated cells 
following basal stem/progenitor cell ablation. a, Schematic representation of 
ciliated cell ablation. Ciliated, secretory and basal cells are shown in blue, 
pink and grey, respectively. b, Immunostaining for SCGB1A1 (green), FOXJ1 
(red) and CK5 (cyan) on control (top) or tamoxifen (Tam)-treated FOXJ1- 
DTA mice (bottom) (n = 6 mice). ¢, Absolute cell number of each cell type in 
both groups (n = 3 mice). d, Percentage of FOXJ1" cells per total DAPI* 
cells over time (n = 3 mice). NS, not significant when compared to day 0 of the 
same group. e, Percentage of FOXJ1” cells in tamoxifen treated mice (n = 3 
mice). f, Schematic representation of secretory cell lineage labelling and 

basal cell ablation. g, Immunostaining for FOXJ1 (red), YFP (green) and CK5 
(cyan) on inhaled (i)-PBS (top) or i-Dox (bottom) treated SCGB1A1-YFP; 
CK5-DTA mice (n = 3 mice). White arrowheads, lineage-labelled ciliated cells. 
h, Percentage of SCGBIA1* and FOXJ1* cells per total YFP* cells. Nuclei, 
DAPI (4’,6-diamidino-2-phenylindole, blue). n = biological replicates/ 
condition repeated twice (two independent experiments). **P < 0.01, 

***P < 0,001. Error bars, means + s.e.m. Scale bars, 20 tum. 


cells (92.7 + 8% of N2ICD™ cells were secretory cells. n = 3 mice), 
whereas negligible amounts of N2ICD were detected in basal stem/ 
progenitor cells (1.5 + 3%) and none was seen in ciliated cells (Fig. 2a-e). 
Consistently, 85.1+5.9% of SSEA-1* cells and 93.7 +2.1% of 
SCGB1A1* cells demonstrated active N2ICD expression (Fig. 2b-e). 
To confirm these observations, we stained airway sections from B1- 
eGFP mice (in which enhanced GFP (eGFP) is expressed exclusively 
in secretory cells)'*”, and found that 92.6 + 2.2% of eGFP* cells co- 
expressed N2ICD (Fig. 2f, g). Unlike the cell specificity associated 
with N2ICD, we found activated Notchl (N1ICD) was expressed in 
most basal stem/progenitor cells and secretory progenitors (Extended 
Data Fig. 4b). Active N3ICD was detected in subsets of basal, secretory 
and ciliated cells (Extended Data Fig. 4c). Additionally, the Notch 
target genes Heyl and HeyL were enriched in secretory progenitor 
cells (Extended Data Fig. 4d). 

To test directly whether sustained tonic Notch activation is required 
to maintain secretory cell fate, we abrogated Notch signalling in these 
cells using Scgb1al-creER; LSL-YFP; RBPjk" mice (hereafter referred 
to as SCGB1A1-RBPjk™"). The efficient deletion of RBPjk, an essential 
transcription factor required for canonical Notch signalling”, was 
confirmed (Extended Data Fig. 5a—-c). As a consequence of RBPjk 
deletion, the Notch target genes Hes1 and HeyL were downregulated 
(Extended Data Fig. 5c). There is a population (approximately 20%) of 
YFP* secretory cells in which RBPjk deletion has not occurred 
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Figure 2 | Secretory progenitor cells show tonic Notch2 activity at steady 
state. a, Schematic representation of airway epithelial cell isolation. 
Relative messenger RNA expression of Notch2 in sorted cells (n = 3 mice) 
(middle). Percentage of each cell type per total N2ICD* cells (right). 

b-e, Immunostaining for p63 (left) or FOXJ1 (right) (green), SSEA-1 (b) or 
SCGB1A1 (d) (cyan) and N2ICD (red). Percentage of N2ICD* cells per total 
SSEA-1* (c) or SCGBIA1* (e) cells (n = 3 mice). f, Immunostaining for 
eGFP (green) and N2ICD (red) in B1-eGFP mice. g, Percentage of N2ICD* 
cells per total eGFP* cells (n = 3 mice). Nuclei, DAPI (blue). White 
arrowheads, double-positive cells. Images are representative of n = 3 mice 
(biological replicates). *P < 0.05, *** P < 0.001. Error bars, means + s.e.m. 
Scale bars, 20 tm. 


(yellow arrows in Extended Data Fig. 5a), accounting for the residual 
RBPjk message (Extended Data Fig. 5c). We assessed the fate of 
lineage-labelled secretory cells following RBPjk loss (Fig. 3a) and 
found that YFP* cells were less likely to express secretory cell markers 
SCGB1A1 (94.4 + 0.9% versus 31.3 + 2.2% of YFP™ cells), SCGB3A2 
(93.6 + 1.2% versus 25.7 + 2.3%) and SSEA-1 (90+ 1.7% versus 
23.5 + 1%) at the protein level, and were more likely to express the 
ciliated cell proteins FOXJ1 (5.1 + 0.6% versus 68.2 + 3.1%), acety- 
lated tubulin (AcTub) (7.4 + 1.3% versus 70.6 + 3.8%) and c-MYB 
(n = 6 mice) (Fig. 3b, c and Extended Data Fig. 5d, e). A decrease in 
the expression of the secretory cell-specific genes Scgblal and Scgb3a2 
and an increase in the expression of the ciliated cell genes FoxJ1 and 
c-myb in lineage-labelled YFP* cells was also observed (n = 3 mice) 
(Fig. 3d). Similarly, secretory cells that had undergone recombination 
and lost RBPjk concomitantly lost their characteristic N2ICD express- 
ion as they switched fate into FOXJ1 * ciliated cells (Fig. 3e). Less than 
0.1% of YFP" cells co-expressed CK5, suggesting that the lack of Notch 
signalling in secretory cells is not responsible for the dedifferentiation 
of secretory cells into basal cells that we previously described following 
basal cell ablation’* (Extended Data Fig. 5f, g). The cell fate changes 
described above were confirmed by flow cytometry (Extended Data 
Fig. 5h, i) and the phenotype persisted over time (Extended Data 
Fig. 6a-e). Moreover, overall airway cell proliferation and apoptosis 
were not affected by RBPjk loss (Extended Data Fig. 6f-k). RBPjk loss 
induced the direct differentiation of secretory cells into ciliated cells in 
the absence of proliferation as only 1.7 + 1.1% of all FOXJ1* cells had 
incorporated 5-bromodeoxyuridine (BrdU) over the course of the 
experiment (Extended Data Fig. 6f) and not a single BrdU’ YFP* 
FOXJ1” ciliated cell was found following continuous BrdU admin- 
istration (Extended Data Fig. 6h, i). Together these results suggest that 
tonic canonical Notch activity in secretory progenitor cells is necessary 
for their continued maintenance at steady-state, and that Notch acts by 
preventing the differentiation of the secretory progenitor cell pool into 
the terminally differentiated post-mitotic ciliated cell pool. 

To determine whether secretory-cell-specific N2ICD transduces a 
putative basal cell signal that is required for the maintenance of the 
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Figure 3 | Tonic Notch2 activity is required to 
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secretory cell pool, we deleted Notch2 from secretory cells using 
Scgb1 al-creER;LSL-YFP:Notch2" “fl mice (hereafter referred to as 
SCGB1A1-Notch2") (Fig. 3a). We first confirmed the efficient dele- 
tion of Notch2 and the downregulation of Hes1 and HeyL (Extended 
Data Fig. 7a-d). Upon Notch2 deletion, we observed that lineage- 
labelled cells ceased to express the secretory cell markers SCGB1A1 
(95.6+ 1.5% versus 23.8 + 3%), SCGB3A2 (90.84 1.3% versus 
6.8 = 1%) and SSEA-1 (88.2 + 2.8% versus 22.7 + 1%) and acquired 
the expression of the ciliated cell markers FOXJ1 (5.7 = 2.1% versus 
78 + 0.7%), AcTub (3.7£1.9% versus 57.66+6%) and c-MYB 
(5.6+0.4% versus 84.542.3%) (n=7 mice) (Fig. 3f, g and 
Extended Data Fig. 7e, f). The expression of secretory cell genes 
(Scgblal and Scgb3a2) was consistently downregulated in lineage- 
labelled cells, while ciliated cell genes (FoxJ1 and c-myb) were upregu- 
lated (n = 3 mice) (Fig. 3h). Intriguingly, YFP staining was present in 
the actual cilia of lineage-labelled cells, consistent with the terminal 
differentiation of secretory cells into mature ciliated cells (Fig. 3f, i). 
Flow cytometry analysis confirmed these cell fate transitions 
(Extended Data Fig. 7g, h) and also confirmed a lack of dedifferentia- 
tion of secretory cells into basal stem cells following Notch pathway 
modulation (Extended Data Fig. 7i, j). The observation that N2ICD 
and FOXJ1 expression remained mutually exclusive following Notch2 
deletion also suggested a largely completed cell fate transition (Fig. 3i). 
However, very rarely, YFP" cells expressing both markers were 
observed, leading one to speculate that these rare cells are transient 
cells caught in the process of differentiating from a secretory cell into a 
ciliated cell (Extended Data Fig. 8a). Similarly, rare lineage-labelled 
cells also co-express SSEA-1 and FOXJ1 (Extended Data Fig. 8b). 
Furthermore, following Notch2 elimination, Ki67 and BrdU incorp- 
oration and rates of apoptosis remained unchanged (Extended Data 


Scgb1a1 Scgb3a2 SSEA1 FoxJ1 AcTub 


Scgb3a2, FoxJ1 and c-myb in control and 
experimental YFP™ cells (n = 3 mice). 

e, Immunostaining for RBPjk (cyan), N2ICD (red) 
and FOXJ1 (green). White arrowheads, 

RBPjk N2ICD~ FOXJ1* cells. i, Immuno- 
staining for YFP (green), FOXJ1 (cyan) and N2ICD 
(red). White arrowheads, FOXJ1* cells. Yellow 
arrows, N2ICD* cells. White arrows, actual cilia in 
lineage-labelled cells. Nuclei, DAPI (blue). 

n = biological replicates/condition repeated three 
times (three independent experiments). 

***D < 0.001; error bars, means + s.e.m. Scale 
bars, 20 um. 
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Fig. 8c-g). Additionally, secretory cells directly differentiated into 
ciliated cells in the absence of proliferation since an insignificant 
(1.4 + 1.7%) percentage of FOXJ1~ cells were BrdU* following con- 
tinuous BrdU administration (Extended Data Fig. 8d, e). Together 
these data demonstrate that tonic Notch2 activity within secretory cells 
is required for the maintenance of secretory cells. Based upon the 
results of the basal cell ablation, we speculated that the Notch signal- 
sending cells are basal stem/progenitor cells. 

Consistent with prior studies®*'®'***, we found that Dill and Jag2 
were expressed in basal stem/progenitor cells, while Jag] was enriched 
in ciliated cells (Fig. 4a), and D/l3 and Dil4 were undetectable (data 
not shown). To remove the putative Notch signal arising from basal 
stem/progenitor cells, we deleted Mindbomb1 (Mib1) which is an 
E3 ubiquitin ligase required for the normal endocytic processing 
of all Notch ligands* in basal cells using CK5-rtTA; tet(O) 
Cre; Mindbombi™" mice (hereafter referred to as CK5-Mib1") 
(Fig. 4b). Upon efficient removal of Mib1 (93.3 + 3.8% of basal cells) 
(Extended Data Fig. 9a, b), a decrease in SCGB1A1* (42.8 + 0.9% 
versus 26.2 + 1.0%), SCGB3A2* (44.6 + 6.6% versus 6.2 + 0.7%) and 
SSEA-1* secretory cells (49.2 + 2.6% versus 24.7 + 1.1%) was accom- 
panied by an increase in FOXJ1* (30.1 + 0.9% versus 36.1 + 1.0%), 
AcTub™ (21.7 + 0.7% versus 24.8 + 0.7%), and c-MYB" ciliated cells 
(30.8 + 2.9% versus 56.2 + 8.0%) (nm = 4 mice) (Fig. 4c, d and Extended 
Data Fig. 9c, d). A corresponding significant decrease in the percentage 
of N2ICD* secretory cells was observed (43 + 1.7% versus 29.6 + 0.8% 
of total epithelial cells) (Fig. 4e, f), confirming that Notch ligands 
emanating from stem cells are necessary for N2ICD activity in secretory 
cells. These results were confirmed by flow cytometry which addition- 
ally revealed that there were no changes in the abundance of basal cells 

(Extended Data Fig. 9e, f). Rates of proliferation and apoptosis were 
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also unchanged (Extended Data Fig. 9g-l) and a negligible amount 
(0.77 + 1.5%) of FOXJ1* cells were found to incorporate BrdU after 
continuous BrdU administration (Extended Data Fig. 9i, j). In addition, 
the cell fate changes described above continued to be present 5 weeks 
after Mib1 deletion (Extended Data Fig. 9m). 

These results are consistent with the model that basal stem/progen- 
itor cells send an essential signal to secretory progenitor cells, and this 
signal is necessary for the maintenance of the appropriate balance of 
cell types in the airway epithelium. As Jag2 is the most abundantly 
expressed ligand in basal stem cells (Fig. 4a), we knocked down Jag2 
expression in vitro using short hairpin RNA (shRNA) lentiviral vectors 
(Extended Data Fig. 10a—c). This resulted in a decrease in Scgblal and 
Scgb3a2 expression and an increase in FoxJ1 and c-myb expression 
(Extended Data Fig. 10d), resembling the effects of in vivo Notch 
signalling disruption. To confirm that Jag2 is the signal emanating 
from basal stem/progenitor cells, we generated CK5-creER; LSL- YFP; 
Jagged2"" mice (hereafter referred to as CK5-Jag2") to genetically 
remove Jag2 from basal stem/progenitor cells in vivo (Fig. 4a). Jag2 
deletion was confirmed (Extended Data Fig. 10e) and although the 
efficiency of recombination as judged by the number of YFP* recom- 
bined cells was approximately 10% (Extended Data Fig. 10f), the dele- 
tion caused a striking decrease in N2ICD* suprabasal cells (43 + 6.6% 
versus 17 + 4.5% of total airway epithelial cells) (Fig. 4g, h) confirming 
that Jag2 is the basal cell signal responsible for activating N2ICD in 
secretory cells. We observed a consistent decrease in SCGBIA1* 
(63 + 2.1% versus 44.4 + 3.3%), SCGB3A2* (55 + 7% versus 17.5 + 
0.5%) and SSEA-1* secretory cells (42.8 + 2% versus 21.8 + 2%) anda 
concomitant increase in FOXJ1* (31.3 + 3.6% versus 46.6 + 2.2%), 
AcTub* (21.7 + 2.1% versus 46.2 + 3.9%) and c-MYB* ciliated cells 
(28.2 + 2.1% versus 49.6+ 11.3%) (n=5 mice) (Fig. 4i, j and 
Extended Data Fig. 10g, h). Results were also confirmed by flow cyto- 
metry (Extended Data Fig. 10i, j). Furthermore, we found no difference 
in the percentage of p63* (also known as Trp63) basal cells (Extended 
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AcTub* and c-MYB* cells in control and 
experimental mice. n = 4 mice (d); n = 5 mice 

(j). e, g, Immunostaining for N2ICD (red) and YFP 
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(f); n = 5 mice (h)). i, Immunostaining for YFP 
(green) and SCGB1A1 (left) or FOXJ1 (right) (red) 
in control (top) and experimental CK5-Jag2/" 
mice (bottom) (n = 5). Nuclei, DAPI (blue). 

n = biological replicates/condition repeated twice 
(CK5-Mib1 mice) or three times (CK5-Jag2 mice). 
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Data Fig. 10k, 1). N2ICD and FOXJ1 expression was mutually exclus- 
ive, consistent with a completed cell fate transition (Extended Data 
Fig. 10m), and there were no differences in overall proliferation and 
apoptosis (Extended Data Fig. 10n-r). 

Together, our results show that basal stem/progenitor cells regulate 
the maintenance of their own progeny through a mechanism in which 
basal-stem-cell-produced Jag2 activates Notch2 in daughter secretory 
progenitor cells to prevent secretory cell differentiation into post- 
mitotic ciliated cells (Extended Data Fig. 1). 

Schofield first introduced the term niche to make sense of 
experimental evidence that suggested the presence of local environ- 
ments necessary for the maintenance of haematopoietic stem cells’. 
However, he was explicit in referring to stem cell niches. We now show 
that stem/progenitor cells themselves serve as ‘daughter cell niches’ 
(Extended Data Fig. 1c). We would like to suggest that reciprocal forms 
of niche-type regulation may be a general feature of many tissues in 
which stem, progenitor and differentiated cells might all regulate the 
maintenance of one another. 

To serve as a progenitor cell niche, airway stem/progenitor cells use 
a ‘forward signal’ sent to their own progeny. We define a forward signal 
as a signal that is relayed from a parent cell to its daughter cell. 
Interestingly, in parallel to our mammalian example, in the fly midgut, 
a forward Notch signal is sent from an intestinal stem cell to alter the 
fate choice of its own downstream progeny’. However, from one 
setting to the next, Notch, with its myriad receptors and ligands, will 
inevitably be deployed in very divergent ways, even within the same 
tissue***’”?®. For example, following injury, airway basal stem/pro- 
genitor cells use a mechanism akin to lateral inhibition to segregate 
their lineages”, whereas pan-epithelial Jag2 deletion alters the distri- 
bution of airway progenitors in the embryonic airway epithelium, and 
in this context Notch3 is thought to be the relevant receptor”*. Notably, 
we identify Notch2 as the receiving receptor on secretory cells. N2ICD 
is, to the best of our knowledge, the first transcription factor that has 


©2015 Macmillan Publishers Limited. All rights reserved 


been found to be specific to steady-state adult airway secretory pro- 
genitor cells. 

More generally, we note that differentiated cells are commonly 
thought to send back signals to their respective stem and progenitor 
cells to regulate their proliferation and differentiation®*. This process 
is generally termed feedback regulation, and we were surprised not to 
see evidence of such a regulatory mechanism following ciliated cell 
ablation. More recently, self signals have been identified that mediate 
autocrine stem cell regulation’. As we demonstrate the existence of a 
forward signal, we would like to suggest that ‘forward regulation’ by 
stem cells is likely to exist (Extended Data Fig. 1d). Although it is 
tempting to call this form of regulation ‘feed-forward regulation’ to 
contrast it with ‘feedback regulation’, this term has been used in con- 
trol theory to denote a more complex form of regulation that involves 
three discrete entities that interact in a loop”*”*. Therefore, we opt to 
propose the simpler term ‘forward regulation’. To illustrate what we 
intend to suggest, we note that Notch signals in fly intestinal stem 
cells occur at varying levels of Notch activation that in turn determine 
daughter cell fate*°. Thus, the regulation of these forward Notch 
signals could be used to alter the distribution and ratio of daughter 
cell types. In our case, perhaps fluctuations in basal cell ligand levels 
determine the rate of ciliated cell turnover? And how would such 
forward signals be modulated following tissue injury? A recent study 
points to Notch2 as a receptor relevant to human asthma”’. Perhaps 
increasing basal cell ligand concentration is a mechanism used to 
engender the asthmatic epithelial phenotype in which secretory 
daughter cells differentiate into mucous-secreting goblet cells. Thus, 
we speculate that stem cells, using forward regulatory mechanisms, 
may orchestrate many tissue-wide changes, rather than merely acting 
as a source of new cells. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Animals. FOXJ1-creER"', CK5-rtTA*', Scgblal-creER", tet(O)cre (JAX 006224), 
CK5-creER”, tet(O)DTA™, Rosa26R-DTA (JAX 009669), RBPjk”"4, LSL-YFP 
(JAX 006148), Mindbomb#”"’, Notch2™" (JAX 010525), Jag2™"* and C57BL6/ 
J JAX 000664) mice were previously described. Progeny of Scgblal-creER and 
LSL-YFP crosses as well as CK5-rtTA and tet(O)DTA crosses were subsequently 
mated to generate Scgblal-creER;LSL-YFP;CK5-rtTA;tet(O)DTA mice’*. These 
mice were treated with tamoxifen and then with inhaled PBS (control) or inhaled 
doxycycline as previously described’*. Scgblal-creER mice were crossed with 
RBPjk"" mice to generate secretory-progenitor-specific Segb1a1-creER;RBPjk"" 
conditional knockout mice. To allow for lineage tracing, these mice were 
crossed with LSL-YFP mice to generate Scgblal -creER;LSL-YFP;RBPjk!" 
mice. Tamoxifen was administered by intraperitoneal injection (2 mg per day) 
for five consecutive days to induce the Cre-mediated recombination. 
Similarly, Scgblal -creER:LSL-YFP;Notch2™" mice were generated and treated. 
CK5-rtTA and tet(O)cre mice were crossed to generate CK5-rtTA;tet(O)cre 
mice. CK5-rtTA;tet(O)cre mice were crossed with Mindbomb! mice to generate 
basal-stem-cell-specific CK5-rtTA;tet( O)Cre;Mindbombe conditional knockout 
mice. Doxycycline administration was performed through drinking water (1 mg 
per ml) for 2weeks as described previously*!**. CK5-creER;LSL-YFP,Jag2"" 
mice were generated and treated, in this case with 2 doses of tamoxifen, due 
to a higher sensitivity of this strain to the compound. Mice were euthanized 
10 days after the last tamoxifen injection. Male 6-12-week-old mice were used 
for experiments except in specific circumstances in which breeding limitations led 
to the use of females in the following strains: Scgblal-creER;LSL-YFP;CK5- 
rtTA;tet(O)DTA and CK5-rtTA;tet(O)Cre;Mindbombi™ mice. Similarly aged 
mice were used for both control and treated animals. Controls include corn 
oil-treated FOXJ1-creER; tet(O)DTA mice, i-PBS treated Tam-induced Scgb1a1- 
creER;LSL-YFP;CK5-rtTA;tet(O)DTA mice, Tam-treated Scgblal-creER;LSL- 
YFP;RBPjk"”* mice, Tam-treated Scgb1a1-creER;LSL-YFP;RBPjk*’* mice, Tam- 
treated Scgbl al-creER;LSL-YFP;Notch2*’* mice, doxycycline-treated CK5- 
rtTA;tet(O)Cre;Mib1*’* mice and Tam-treated CK5-creER;LSL-YFP;Jag2‘/ 7 
mice. BrdU (5 mg) 2h before euthanasia in all cases. Additionally, we treated 
mice with 1 mg ml’ of BrdU in drinking water from the time of the last tamox- 
ifen injection to euthanisia to analyse proliferative events occurring as a con- 
sequence of genetic modulation. We analysed at least 3-7 mice per condition in 
each experiment and all the experiments were repeated at least three times with 
the exception of CK5-rtTA;tet( O)Cre;Mindbomb i" “fl and the cell ablation experi- 
ments that were repeated twice. All procedures and protocols were approved by 
the MGH Subcommittee on Research Animal Care in accordance with NIH 
guidelines. 

Tissue preparation, immunohistochemistry and immunofluorescence. Mouse 
trachea were removed using sterile technique and then fixed in 4% paraformalde- 
hyde for 2h at 4°C, washed with PBS, and transferred to a 30% sucrose solution 
overnight. For immunofluorescence, airways were embedded in OCT and 
cryosectioned as transverse 7-|1m sections. Cryosections were stained with the 
previously described protocol'*?!***’. The following antibodies were used: rabbit 
anti-caspase3, cleaved (1:100, 9661, Cell Signaling); rabbit anti-cytokeratin 5 
(1:1000; ab53121, Abcam); mouse anti-FOXJ1 (1:500; 14-9965, eBioscience); 
chicken anti-green fluorescent protein (1:500; GFP-1020, Aves Labs); goat anti- 
GFP (1:100; NB-100-1770, Novus Biologicals); anti-Ki67 (1:200; ab15580, 
Abcam); rat anti-RBPjk (1:100; SIM-2ZRBP2, Cosmobio); goat anti-SCGB1A1 
(1:500; provided by B. Stripp); goat anti-CC10 (1:100; sc-9772, Santa Cruz 
Biotechnology), rabbit anti-SCGB3A2 (1:100; provided by Shioko Kimura); 
mouse anti-p63 (1:100; sc-56188, Santa Cruz Biotechnology); mouse IgM anti- 
SSEA-1 (1:100; 14-8813-82, eBioscience), mouse anti-tubulin, acetylated (1:100; 
T6793, Sigma), rabbit anti-alpha smooth muscle actin (1:100; ab5694, Abcam), 
rat anti-CD45 (1:100; 14-0451, eBioscience) and rat anti-CD31 (1:100; 553370, 
BD Pharmingen). BrdU incorporation was detected using Amersham Cell 
Proliferation Kit (RPN20, GE Healthcare, Waukesha, WI). Cell death was detected 
using DeadEnd Fluorometric TUNEL System (G3250, Promega, Madison, WI). 
Appropriate secondary antibodies (Life Technologies’ Alexa Fluor series 488, 594 
or 647) were diluted 1:500. In the case of rabbit anti-Notch2 (1:2000; D67C8, Cell 
Signaling), rabbit anti-activated Notch1 (1:1500, ab8925, Abcam), rabbit anti- 
Notch3 (1:1500, sc-5593, Santa Cruz Biotechnologies), rabbit anti-c-MYB 
(1:3000; sc-519, Santa Cruz Biotechnology) and rabbit anti-Mindbomb1 (1:500, 
M6073, Sigma), following primary antibody incubation, sections were washed and 
incubated with anti-Rabbit-HRP conjugate (1:1,000; 170-6514, Bio-Rad) for 1 h at 
room temperature followed by tyramide signal amplification. Sections were then 
washed an incubated for 30min at room temperature with streptavidin-594 
(1:1,000; S-11227, Life Technologies)”. For more information on the protocol 


to detect low levels of c-MYB and N2ICD using tyramide signalling amplification, 
please refer to the Rajagopal laboratory website (http://www.massgeneral.org/ 
regenmed/staff/Rajagopallab). 

Microscopy and imaging. Tissue was imaged using an Olympus FluoView FV 10i 
confocal microscope (Olympus Corporation). Cells were manually counted based 
on immunofluorescence staining of markers for each of the respective cell 
types*’*’. Briefly, cell counting was performed on the basis of nuclear staining 
with DAPI (nuclei) and specific cell markers. Cells were counted using X40 
magnification fields (each field represented 250 j1m of epithelium) covering the 
whole tracheal epithelium, from cartilage ring 1 to 10, of each mouse. This includes 
approximately 1,300 to 1,800 DAPI* cells per experiment. In CK5-creER;LSL- 
YFP;Jag2"" mice, given the low (approximately 10%) rate of genetic recombina- 
tion, we showed images in regions where there were patches of YFP” basal cells 
that had undergone recombination, and therefore Jag2 deletion. Of note, cell 
counts were performed manually throughout the entire tracheal epithelium, and 
were not restricted to areas of basal cell recombination even in these mice. Images 
were processed and analysed using ImageJ/Fiji (NIH) and Adobe Photoshop 
Creative Suite 5 (Adobe). 

Cell dissociation, FACS and flow cytometry analysis. Airway epithelial cells 
from trachea were dissociated using papain solution as previously described’’. 
Briefly, following trachea removal, airway tissue was cut into small fragments 
and transferred to a 2 ml solution containing 1 ml 100 U of pre-activated papain 
(Worthington Biochemical Corporation, catalogue number LK003182) and 1 ml 
of activation buffer as per the manufacturer’s protocol. Tissue fragments were 
incubated on a shaking platform for 90 min at 37°C. The cell suspension was 
passed through a 70 1m cell strainer to remove airway husks and pelleted for 5 min 
at 400g. The supernatant was aspirated and the pellet was resuspended in ovomu- 
coid solution (Worthington Biochemical Corporation, catalogue number 
LK003182) for 20 min at 4°C to inactivate residual papain activity. Dissociated 
cells were stained with the following antibodies: Ep>CAM-PECy7 (1:50; 25-5791- 
80, eBiosciences) or EyCAM-APC (1:50; 17-5791, eBiosciences); GSIB4 (Griffonia 
Simplicifolia Isolectin beta4)-Biotin (L2120, Sigma); SSEA-1 eFluor 650NC (1:75, 
95-8813-41, eBiosciences); CD24-PE (1:100, 553262, BD Pharmingen). Primary 
antibodies were incubated for 30 min in 2.5% FBS in PBS on ice. FACS and flow 
cytometry was performed on a BD FACSAria II sorter at the CRM Flow Cytometry 
Core (Boston, MA). All aforementioned cell sortings were previously gated for 
EpCAM to exclusively select epithelial cells. Of note, differences in the percentage 
of each airway epithelial cell type analysed by flow cytometry might differ from the 
quantitation performed by cell counting. This reflects the use of cell surface mar- 
kers for flow analysis (CD24 for ciliated cells) in contrast to cell counts based on 
the nuclear transcription factors (such as FoxJ1 and c-MYB for ciliated cells). 
Additionally, flow cytometry involves enzymatic tracheal dissociation and cells 
may die in this process and some cell types might demonstrate differential viability 
following enzymatic dissociation. Sorted cells were lysed immediately in TRI 
Reagent (Sigma) and RNA was extracted as previously described’’. Data were 
analysed on FlowJo Software (version 10). 

RNA extraction and quantitative RT-PCR. Total RNA was extracted from 
sorted airway epithelial cells from individual mice to analyse gene expression 
by quantitative RT-PCR. These procedures were performed as previously 
described*’. Relative mRNA expression was normalized to baseline transcript 
levels in secretory progenitor cells in Figs 2a and 4a, and in control YFP* cells 
in Fig. 3d, h. In addition, the primer sequences for the following genes were used: 
Notch1: forward 5'-tgagactgccaaagtgttgc-3' and reverse 5'-gtgggagacagagtggstgt- 
3'; Notch2: forward 5'-cctgaacgggcagtacattt-3’ and reverse 5’'-gcgtagcccttcaga 
cactc-3'; Notch3: forward 5'-tgagtgtccagctggctatg-3’ and reverse 5'-cacaggtgcc 
attgtgtagg-3’; Dill: forward 5'-ttagcatcattggggctacc-3’ and reverse 5'-taagtgtt 
gegecgatcttc-3’; Jag]: forward 5’-cagtgcctctgtgagaccaa-3’ and reverse 5'-agggetc 
agagagacaagca-3'; Jag2: forward 5’-cagatccgagtacgctgtga-3’ and reverse 5’-ggct 
tctttgcattctttgc-3'; Hes1: forward 5’-ctaccccagccagtgtcaac-3’ and reverse 5’-atgcc 
gegagctatctttct-3’; Heyl: forward 5'-gagaccatcgagetggaaaa-3’ and reverse 5’-agcag 
atccctgcttctcaa-3’; HeyL: forward 5'-cccccttaccctatctcagc-3’ and reverse 5’-acat 
getggeattggegacta-3'; RBPjk exons 6-7: forward 5'-ggcagtggttggaagaaaaa-3' and 
reverse 5'-atgtcatcgctgttgccata-3'; Notch2 exon3: forward 5'-aacatcgagacc 
cctgtgag-3' and reverse 5’ -ggctgagcatgtgacaggta-3’; Jag2 exon2: Forward 5’-cgtgtg 
ccttaaggagtacca-3' and reverse 5'-gcgaactgaaagggaatgac-3'; Scgb3a2: forward 
5' gacaggactgaagaagtgtetge-3’ and reverse 5’-ggagettgttcacgtagcaaagg-3'; c-myb: 
forward 5'-gctgaagaagctgetggaac-3’ and reverse 5’-caacgcttcggaccatattt-3’. 

Cell culture and viral transduction. Mouse tracheal epithelial cells were disso- 
ciated with papain and sorted with EpCAM and GSIP4 as previously described’. 
Cells were cultured and expanded in complete SAGM (small airway epithelial cell 
growth medium; Lonza, CC-3118) using 5mM Rock inhibitor Y-27632 
(Selleckbio, $1049). To initiate air-liquid interface (ALI) cultures, airway basal 
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cells were dissociated and seeded onto transwell membranes. After confluence, 
media was removed from the upper chamber. Mucocilary differentiation was 
performed with PneumaCult-ALI Medium (StemCell, 05001). Differentiation of 
airway basal cells on ALI was followed by directly visualizing beating cilia in real 
time after 8 days. One day after plating, mouse basal cells were infected with 
lentiviral vectors carrying shRNAs targeting mouse Jag2. Four different clones 
were obtained from Sigma (MISSION shRNA jagged2 NM_010588, clones 
TRCN0000028858, TRCN0000028871, TRCN0000028877, TRCN0000028906), 
and cloned into pLKO.1 vector (Addgene Plasmid 10878). Lentiviral production 
was performed in HEK293 cells following standard protocols. Concentrated 
viruses were used at a MOI of 6 to infect murine basal cells for 9h at 37°C in 
5% COz, one day after plating. The cells were allowed to grow to confluence before 
being transferred onto transwell membranes. Then 23 days after ALI initiation, 
cells were washed, collected and sorted for GFP and cell specific markers. To assess 
the efficiency of shRNA Jag2 knockdown, non-purified infected cells were col- 
lected 72 h after infection and lysed in TRI Reagent. 

Statistical analysis. The standard error of the mean was calculated from the 
average of the indicated number of samples in each case ( = biological repli- 
cates/condition/experiment). All the experiments were repeated at least three 
times with the exception of CK5-rtTA;tet( O)Cre;Mindbomb and the cell abla- 
tion experiments that were repeated twice. Data was compared among groups 
using the Student's t-test (unpaired, two-tailed test). A P value of less than 0.05 was 
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considered significant. The analysis was performed with Prism software 
(Graphpad Prism version 5.0a). 

Data reporting. No statistical methods were used to predetermine sample size. 
Experiments were performed completely blinded, being repeated by two different 
investigators and some of the experiments were repeated without knowing which 
samples were analysed. 
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Extended Data Figure 1 | Parent stem/progenitor cells can serve as niches _ciliated cells. c, A schematic of the traditional arrangement of a stem cell that is 
for their own daughter cells. a, Schematic representation of the airway maintained ina stem cell niche (left) and a schematic that further illustrates that 
epithelial cell lineage. Basal stem/progenitor cells give rise to secretory stem cells can themselves serve as daughter cell niches, in which the parent stem 
progenitor cells that, in turn, give rise to terminally differentiated ciliated cells. cell itself is required for the maintenance of its own progeny (right). 

b, Basal cells expressing Notch ligands provide a tonic forward Notch signal to —_d, Schematic of the types of signals that occur between cells within a lineage and 
neighbouring secretory daughter cells. Blocking this forward signal prevents the theoretical modes of cell regulation that they imply. Blue arrows indicate a 
Notch activation in secretory cells and results in their differentiation into lineage relationship. Red arrows represent signals. 


©2015 Macmillan Publishers Limited. All rights reserved 


Control 


Day5 


Day 150 | Day 60 Day 45 Day 30 ]| Day 15 
3 ? pe 
¢ 
i > 


oO 


Ki67* cells per DAPI* cells (%) 


FOXJ1-DTA (d3) 
Tam __][_Control_] 


FOXJ1-DTA (43 
[Control _] 
oe 


Tam 
BrdUt cells per DAPI* cells (%) 


pS: 2 V3 Gigs 4 
S00. 508 BES ; 


a 
& 
fs 
9 
= 
x 
(e) 
re 


Extended Data Figure 2 | Ablation of ciliated cells has no effect on airway 
cell proliferation, mesenchymal cell types, mesenchymal morphology and 
airway stem and progenitor cell replication over time. a, Immunostaining for 
basal (CK5 (green)), ciliated (FOXJ1 (red)), and secretory cells (SCGB1A1 
(white)) on either control (left panels) or tamoxifen (Tam)-treated FOXJ1- 
DTA mice (right panels) 5, 15, 30, 45, 60 and 150 days after ciliated cell ablation 
(n =3 mice). b, Quantification of absolute cell numbers of basal CK5* cells 
(top graph) and secretory SCGB1A1" cells (bottom graph) per trachea on 
control (black bars) or Tam-treated (white bars) mice over time (nm = 3 mice). 
c, Immunostaining for ciliated cells (FOXJ1 (green)) and proliferating cells 
(Ki67 (red)) on either control (upper panel) or tamoxifen (Tam)-treated 
FOXJ1-DTA mice (lower panel) (n = 6 mice). On the right, quantification of 
the percentage of Ki67~ cells per total DAPI™ cells in tracheal sections from 
control (C) or Tam-treated mice 3 days after cell ablation (n = 3 mice). 
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d, Quantification of the percentage of proliferating Ki67* cells relative to total 
DAPI‘ cells in control (black bars) or Tam-treated (white bars) mice over 
time (n = 3 mice). e, Immunostaining for ciliated cells (AcTub (green)) and 
cells that have undergone proliferation (BrdU (red)) on either control (upper 
panel) or Tam-treated mice (lower panel) at day 3 (n = 6 mice). On the 
right, quantification of the percentage of BrdU" cells per total DAPI” cells in 
tracheal sections from control (C) or Tam treated mice (n = 3 mice). 
f, Haematoxylin & eosin (H&E) staining of tracheal sections 3 days after ciliated 
cell ablation. g, Immunostaining for CD45* haematopoietic cells (left panels), 
CD31* endothelial cells (middle panels) and SMA* smooth muscle cells 
(right panels) (green) three days after cell ablation (n = 6 mice). Nuclei stained 
with DAPI (blue). The ns indicates that the cell number comparisons are not 
statistically significant. n = biological replicates/condition (two independent 
experiments). Data shown in the graphs are means + s.e.m. Scale bar, 20 um. 
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Extended Data Figure 3 | Basal stem/progenitor cell ablation promotes the 
differentiation of secretory cells into ciliated cells without affecting the 
mesenchyme. a, Immunostaining for YFP lineage label (green) and the ciliated 
cell marker c-MYB (red) in SCGB1A1-YFP; CK5-DTA mice (” = 3 mice). 
White arrowheads point to double positive cells. b, Immunostaining for YFP 
lineage label (green) and the ciliated cell marker AcTub (red) using SCGB1A1- 
YFP; CK5-DTA mice (n = 3 mice). c, Flow cytometry analysis for lineage- 
labelled YFP* cells (x axis) and CD24° ciliated cells (y axis) cells from control 
iPBS-treated or doxycycline-treated SCG1A1-YFP; CK5-DTA mice. 

d, Quantification of the percentage of FOXJ1* cells per total DAPI™ cells 


in tracheal sections from control iPBS-treated or iDOX-treated SCG1A1-YFP; 
CK5-DTA mice (” = 3 mice). On the right, absolute numbers of FOXJ1* 
cells per tracheal section (n = 3 mice). e, H&E staining of tracheal sections 
following basal cell ablation. f, Immunostaining for CD45" haematopoietic 
cells (left panels), CD31* endothelial cells (middle panels) and SMA* smooth 
muscle cells (right panels) (green) in control or basal cell-ablated trachea (n = 3 
mice). All analyses were performed 3 days after cell ablation. Nuclei stained 
with DAPI (blue). n = biological replicates/condition (two independent 
experiments). **P < 0.01. Data shown in the graphs are means ~ s.e.m. 

Scale bar, 20 pm. 
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Extended Data Figure 4 | Characterization of Notch pathway components 
in the steady-state murine tracheal epithelium. a, Relative mRNA expression 
of Notch1, Notch3 and Notch4 assessed by qRT-PCR in pure sorted populations 
of airway epithelial cells (n = 3 mice). Relative expression is normalized to 
baseline transcript levels in secretory progenitor cells. b, Immunostaining for 
NIICD (red) in combination with the basal cell marker p63 (top panel), the 
secretory cell marker SSEA-1 (middle panel) and the ciliated cell marker FOXJ1 
(bottom panel) (green). c, Immunostaining for N3ICD (red) in combination 
with the basal cell marker podoplanin (PDPN) (top panel), the secretory 
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cell marker SSEA-1 (middle panel) and the ciliated cell marker FOXJ1 
(bottom panel) (green). d, Relative mRNA expression of Hes1, Heyl and HeyL 
assessed by qRT-PCR in pure sorted populations of airway epithelial cells 

(n = 3 mice). Relative expression is normalized to baseline transcript levels 
in secretory progenitor cells. n = biological replicates/condition. **P < 0.01; 
***P < 0.001. nd indicates lack of detection. Data shown in the graphs are 
means + s.e.m. Nuclei stained with DAPI (blue). White arrowheads point to 
double positive cells. Scale bar, 20 um. 
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Extended Data Figure 5 | Downregulation of Notch signalling transduction 
following RBPjk deletion in secretory progenitor cells induces their 
conversion into ciliated cells. a, Immunostaining for lineage-labelled YFP* 
cells (green) in combination with RBPjk (red) in Tam-treated SCGB1A1- 
RBPjk'’* control mice (upper panels) and Tam-treated SCGB1A1-RBPjk™/” 
mice (lower panels). White arrowheads point to lineage-labelled RBPjk cells. 
The yellow arrows point to lineage-labelled cells that have not undergone 
recombination. b, Quantification of the percentage of RBPjk* cells per total 
YFP* cells at experimental day 15 following tamoxifen administration to 
SCGB1A1-RBPjk’’* control (black bar) and SCGB1A1-RBPjk” ® mice (white 
bar) (n = 6 mice). c, Relative mRNA expression of Notch signalling component 
genes (RBPjk, Hes1, HeyL) analysed by qRT-PCR in sorted YFP” cells from 
Tam-treated SCGB1A1- RBPjk*/* control mice (black bars) (n = 3 mice) and 
Tam-treated SCGB1A1 -RBPjK" mice (white bars) (n = 4 mice). Relative 
expression is normalized to baseline transcript levels in YFP* control cells. 

d, Immunostaining for YFP lineage label (green) and the secretory progenitor 
cell markers SCGB3A2 (left panels) and SSEA-1 (right panels) (red) in Tam- 
treated SCGB1A1-RBPjk V+ mice (control) (top panels) and SCGB1A1- 
RBPjk" mice (bottom panels). e, Immunostaining for YFP lineage label 
(green) and the ciliated cell markers AcTub (left panels) and c-MYB (right 
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panels) (red) in Tam-treated SCGB1A1- RBPjk’* mice (control) (top panels) 
and SCGB1A1- RBPjKM” mice (bottom panels). White arrowheads point to 
lineage-labelled secretory cells that differentiated into ciliated cells following 
RBPjk deletion. f, Immunostaining for lineage-labelled YFP™ cells (green) and 
the basal cell marker CK5 (red) on either Tam-treated SCGB1A1- RBPjkY - 
control mice (upper panel) or Tam-treated SCGB1A1-RBPjk™" mice (lower 
panel). g, Quantification of the percentage of CK5* cells per total YEP™ cells 
in Tam-treated SCGB1A1- RBPji" mice compared to control mice. h, Flow 
cytometry analysis 0 of EPCAM™ YEP* CD24* lineage-labelled ciliated cells 
and EpCAM* YFP*CD24~ SSEA-1* lineage-labelled secretory cells or 
EpCAM”* YFP* CD24 GSIB4* lineage-labelled basal cells in airways from 
either control or Tam-treated SCGB1A1-RBPjk™" mice. i, Quantification of the 
percentage of epithelial (EpCAM*) lineage-labelled (YEP) basal, secretory 
and ciliated cells in either Tam-treated SCGB1A1-RBPjk*/* control or 
SCGB1A1-RBPjk™" mice by flow cytometry (n = 3 mice). The analysis was 
performed 10 days after the last tamoxifen injection. Images are representative 
of n = 6 mice per condition (biological replicates) repeated three times. 
Nuclei stained with DAPI (blue). **P < 0.01; ***P < 0.001. Data shown in 
the graphs are means + s.e.m. Scale bar, 20 um. 
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Extended Data Figure 6 | Lineage-labelled ciliated cells demonstrate long 
term persistence after RBPjk deletion without a change in epithelial cell 
proliferation and apoptosis. a-d, Immunostaining for the lineage label 
YFP (green) in combination with the secretory cell markers SCGB1A1 

(a), SCGB3A2 (b) or the ciliated cell markers FOXJ1 (c) and AcTub (d) (red) 
on either Tam-treated SCGB1A1-RBPjk”* control mice (upper panels) or 
Tam-treated SCGB1A1-RBPjk”" mice (lower panels) 30 days after the last 
tamoxifen injection (n = 3 mice). White arrowheads point to lineage-labelled 
ciliated cells. e, Quantification of the percentage of each cell type per YFP™ cells 
on either control mice (black bars) or Tam-treated SCGB1A1-RBPjk”" mice 
(white bars) at day 30. f, Quantification of the percentage of ciliated FOXJ1~ 
cells that incorporate BrdU after continuous BrdU administration to Tam- 
treated SCGB1A1-RBPjk™" mice (n = 3 mice). g, Immunostaining for Ki67 
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(red) to assess overall proliferation in either Tam-treated SCGB 1A1-RBPjk” * 
control mice (upper panel) or Tam-treated SCGB1A1-RBPjk™" mice (lower 
panel) (n = 3 mice). h, i, Immunostaining for FOXJ1 (green) and BrdU (red) 
in combination with YFP (cyan) (h) or alone (i) on Tam-treated SCGB1A1- 
RBPjk”" mice that received continuous BrdU (n = 3 mice). j, Immunostaining 
to detect apoptotic cells by TUNEL assay (red) in combination with YFP 
lineage-labelled cells (green) in either Tam-treated SCGBIA1-RBPjk"”* 
control mice (upper panel) or Tam-treated SCGB1A1-RBPjk™”" mice 

(lower panel) (n = 3 mice). k, Immunostaining for activated caspase3 (green) 
in control and Tam-treated SCGB1A1-RBPjk Vl mice (n = 3 mice). 

f-k, Analysis conducted 10 days after induction. Nuclei stained with DAPI 
(blue). n = biological replicates per condition. ***P < 0.001. Data shown in 
the graph are means + s.e.m. Scale bar, 20 jum. 
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Extended Data Figure 7 | Efficient deletion of Notch2 in secretory 
progenitor cells and its effect on cell type distribution. a, Relative mRNA 
expression of Notch2 in YFP* cells from Tam-treated SCGB1A1-Notch2*/* 
control mice and Tam-treated SCGB1A1-Notch2™" experimental mice 
assessed by qRT-PCR (n = 3 mice). b, Relative mRNA expression of the Notch 
target genes (Hes1, HeyL) in YFP* cells from control mice and Tam-treated 
SCGB1A1-Notch2™" experimental mice (n = 3 mice). Relative expression is 
normalized to baseline transcript levels in lineage-labelled YFP* control cells. 
c, Immunostaining for lineage label YFP (green) in combination with N2ICD 
(red) on control mice (Tam-treated SCGB1A1-Notch2*!*) and experimental 
airways (Tam-treated SCGB1A1-Notch2 f/f) White arrowheads point to 
lineage-labelled cells that had lost Notch2 and therefore do not show N2ICD 
expression. d, Quantification of the percentage of N2ICD* cells per total YEP* 
cells in Tam-treated SCGB1A1-Notch2" mice compared to control (n = 7 
mice). e, Immunostaining for YFP lineage label (green) and the secretory 
progenitor cell markers SCGB3A2 (left panels) and SSEA-1 (right panels) (red) 
in control (top panels) and experimental (bottom panels) mice. 

f, Immunostaining for YFP lineage label (green) and the ciliated cell markers 
AcTub (left panels) and c-MYB (right panels) (red) in control (top panels) and 
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experimental (bottom panels) mice. White arrowheads point to lineage- 
labelled secretory cells that differentiated into ciliated cells following Notch2 
deletion. Yellow arrows point to actual cilia (green) in lineage-labelled cells. 
g, Flow cytometry analysis of EPCAM* YEP* CD24" lineage-labelled ciliated 
cells and EpCAM* YFP'CD24 SSEA-1* lineage-labelled secretory cells or 
EpCAM” YFP* CD24 GSIB4* lineage-labelled basal cells in airways from 
either Tam-treated SCGB1A1-Notch2*’* control mice or Tam-treated 
SCGB1A1-Notch2”" mice. h, Quantification of the percentage of epithelial 
(EpCAM*) lineage-labelled (YFP *) basal, secretory and ciliated cells in either 
Tam-treated SCGB1A1-Notch2*!* control (n = 4 mice) or SCGB1A1- 
Notch2"" mice (n = 6 mice) by flow cytometry. i, Immunostaining for the 
basal cell transcription factor p63 (red) on control or SCGB1A1-Notch2”" 
airways. j, Quantification of the percentage of p63” cells per total DAPI™ cells 
on tracheal sections from control or experimental mice (n = 7 mice). Analysis 
performed 10 days after induction. Images are representative of n = 7 mice 
per condition (biological replicates) repeated three times (three independent 
experiments). Nuclei stained with DAPI (blue). *P < 0.05; ***P < 0.001. Data 
shown in the graphs are means + s.e.m. Scale bar, 20 jim. 
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Extended Data Figure 8 | Proliferation and apoptosis following deletion 
of Notch2 in secretory progenitor cells. a, Immunostaining for lineage 
label YFP (green), FOXJ1 (cyan) and N2ICD (red) in Tam-treated 
SCGB1A1-Notch2™" mice. White arrowhead points to a lineage-labelled cell 
co-expressing markers for secretory and ciliated cell fates. The inset shows the 
single stain for FOXJ1 of the indicated region. b, Immunostaining for 

lineage label YFP (green), FOXJ1 (cyan) and SSEA-1 (red) in Tam-treated 
SCGB1A1-Notch2™" mice. White arrowhead points to a lineage-labelled 
transitional cell. c, Immunostaining for BrdU (green), p63 (red) and Ki67 
(cyan) to assess overall proliferation on either Tam-treated SCGB1A1- 
Notch2*’* control mice (upper panels) or Tam-treated SCGB1A1-Notch2™”” 
mice (lower panels). d, Quantification of the percentage of ciliated FOXJ1 * cells 
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that incorporate BrdU after continuous BrdU administration to Tam-treated 
SCGB1A1-Notch2™" mice (n = 4 mice). e, Immunostaining for FOXJ1 (green) 
and BrdU (red) on Tam-treated SCGB1A1-Notch2"" mice that received 
continuous BrdU (n = 4 mice). f, Immunostaining to detect apoptotic cells by 
TUNEL assay (green) on either Tam-treated SCGB1A1-Notch2*’~ control 
mice (upper panel) or Tam-treated SCGB1A1-Notch2"" mice (lower panel). 
g, Immunostaining for YFP (green) in combination with activated caspase3 
(red) on control mice (upper panel) or Tam-treated SCGB1A1-Notch2™" 
mice (lower panel). Analysis performed 10 days after induction. Images are 
representative of n = 7 mice per condition (biological replicates) repeated three 


times (three independent experiments). Nuclei stained with DAPI (blue). 
Scale bar, 20 um. 
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Extended Data Figure 9 | Loss of Notch ligands in basal stem cells promotes 
secretory cell differentiation into ciliated cells without affecting 
proliferation or apoptosis. a, Quantification of the percentage of basal 
PDPN* cells that express Mib1 (left graph) on either Dox-treated CK5- 
Mib1*’~ control mice or Dox-treated CK5-Mib1"" mice (n = 4 mice). Right 
graph, percentage of basal cells in which Mib1 was deleted in Dox-treated CK5- 
Mib1"" mice (n = 4 mice). b, Immunostaining for Mib1 (red) and the basal cell 
marker CK5 (green). White arrowheads point to Mib1~ basal cells. 

c, Immunostaining for the secretory cell markers SCGB3A2 (left panels) and 
SSEA-1 (right panels) (red) in control (top panels) and experimental (bottom 
panels) mice. d, Immunostaining for the ciliated cell markers AcTub (left 
panels) and c-MYB (right panels) (green) in control (top panels) and 
experimental (bottom panels) mice. e, Flow cytometry analysis of EpPCAM* 
CD24" ciliated cells and Ep>CAM* SSEA-1" secretory cells from control and 
experimental mice. f, Percentage of epithelial (EpCAM *) basal, secretory and 
ciliated cells on both groups by flow cytometry (n = 3 mice). 

g, Immunostaining for Ki67 (green) and the secretory cell marker SCGB1A1 
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(red) on control (top panel) or Dox-treated CK5-Mib1™" mice (bottom panel). 


h, Immunostaining for BrdU (green) in combination with the basal cell 
transcription factor p63 (red) on both groups. i, Immunostaining for FOXJ1 
(green) and BrdU (red) on Dox-treated CK5-Mib1" mice that received 
continuous BrdU. j, Percentage of ciliated FOXJ1 * cells that incorporate BrdU 
after continuous BrdU administration to Dox-treated CK5-Mib1"" mice 

(n = 4 mice). k, Immunostaining to detect apoptotic cells by TUNEL assay 
(green) on either control (upper panel) or experimental mice (lower panel). 
1, Immunostaining for activated caspase3 (green) on both groups). 

m, Immunostaining for N2ICD (red), SCGB1A1 and SCGB3A2 (red), or 
FOXJ1 and AcTub (green) in control (top panels) or experimental mice 
(bottom panels) after five weeks of continuous doxycycline treatment (n = 4 
mice). a-l, Analysis performed 2 weeks after the beginning of Dox induction. 
Images are representative of n = 4 mice per condition (biological replicates) 
repeated twice. *P < 0.05; ***P < 0.001. Data shown in the graphs are 
means + s.e.m. Nuclei, DAPI (blue). Scale bar, 20 um. 
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Extended Data Figure 10 | Disruption of Jag2 in basal stem/progenitor cells 
causes the differentiation of secretory progenitor cells into ciliated cells 
without affecting proliferation or apoptosis. a, Schematic representation of 
Jag2 inhibition using lentiviruses (LV) carrying shRNAs. Infected GFP* cells 
were cultured in ALI culture system for 23 days, when they were collected, 
sorted and analysed. b, Relative mRNA expression of Jag2 in tracheal epithelial 
cells infected with mock vector (control) or with vectors carrying 4 different 
shRNAs targeting Jag2 72 h after infection. c, Relative mRNA expression of Jag2 
in tracheal epithelial basal cells infected with mock vector (control) or with 
lentivirus targeting Jag2 (shJag2 877) after 23 days in ALI. d, Relative mRNA 
expression of the secretory genes (Scgblal and Scgb3a2) and the ciliated cell 
genes (FoxJ1 and c-myb) in mock (black bars) and shJag2 877 (grey bars) 
infected cells 23 days after ALI initiation. Relative expression is normalized to 
baseline transcript levels in mock-infected cells. e, Relative mRNA expression 
of Jag2 on sorted recombined (YEP*) basal cells and unrecombined YFP basal 
cells from Tam-treated CK5-Jag2"/@ mice (n = 3 mice). Relative expression is 
normalized to baseline transcript levels in YFP cells. f, Percentage of YEP* 
cells per total DAPI” cells (efficiency of recombination) on either Tam-treated 
CK5-Jag2*/ * control (black bars) or Tam-treated CK5-Jag2”" (white bars) 
mice assessed by manual counting (left graph) (n = 5 mice) or by flow 
cytometry (right graph) (n = 3 mice). g, Immunostaining for SCGB3A2 (left 
panels) and SSEA-1 (right panels) (red) in combination with YFP (green) in 
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control (top panels) and experimental (bottom panels) mice. 

h, Immunostaining for AcTub (left panels) and c-MYB (right panels) (red) in 
combination with YFP (green) in control (top panels) and experimental 
(bottom panels) mice. i, Flow cytometry analysis of EPCAM* CD24" ciliated 
cells and EpPCAM*SSEA-1* secretory cells in control and experimental mice. 
j, Percentage of epithelial (EpCAM”) basal, secretory and ciliated cells from 
both groups assessed by flow cytometry (n = 3 mice). k, Immunostaining for 
p63 (red) on control (top panel) and experimental mice (bottom panel). 

1, Percentage of p63" cells per total DAPI™ cells on both groups. m, Immuno- 
staining for FOXJ1 (green), N2ICD (red) and SCGB1A1 (cyan). n, Immuno- 
staining for BrdU (green), p63 (red) and Ki67 (cyan) in either control 

(upper panels) or experimental mice (lower panels). 0, Percentage of ciliated 
FOXJ1™ cells that incorporate BrdU after continuous administration of BrdU 
to Tam-treated CK5-Jag2”™ fl mice (n = 3 mice). p, Immunostaining for 
FOXJ1 (green) and BrdU (red) on Tam-treated CK5-Jag2" mice that received 
continuous BrdU (n = 3 mice). q, Immunostaining to detect apoptotic cells by 
TUNEL assay (green) on both groups. r, Immunostaining for YFP (green) 

in combination with activated caspase3 (red) on control (upper panel) or 
experimental mice (lower panel). f-r, Analysis performed 10 days after induc- 
tion. Images are representative of n = 5 mice per condition (biological 
replicates) repeated three times. *P < 0.05; **P < 0.01; ***P < 0.001. Data 
shown in the graphs are means + s.e.m. Nuclei, DAPI (blue). Scale bar, 20 um. 
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Expression of barley SUSIBA2 transcription factor 
yields high-starch low-methane rice 


J. Sub?*, Cc. Hub?*, X. Yan?*, Y. Jin??, Z. Chen!, Q. Guan!, Y. Wang’, D. Zhong’, C. Jansson’, F. Wang’, A. Schniirer® & C. Sun? 


Atmospheric methane is the second most important greenhouse 
gas after carbon dioxide, and is responsible for about 20% of the 
global warming effect since pre-industrial times’. Rice paddies are 
the largest anthropogenic methane source and produce 7-17% of 
atmospheric methane*’. Warm waterlogged soil and exuded nutri- 
ents from rice roots provide ideal conditions for methanogenesis in 
paddies with annual methane emissions of 25-100-million 
tonnes**. This scenario will be exacerbated by an expansion in rice 
cultivation needed to meet the escalating demand for food in the 
coming decades*. There is an urgent need to establish sustainable 
technologies for increasing rice production while reducing meth- 
ane fluxes from rice paddies. However, ongoing efforts for meth- 
ane mitigation in rice paddies are mainly based on farming 
practices and measures that are difficult to implement’. Despite 
proposed strategies to increase rice productivity and reduce meth- 
ane emissions**®, no high-starch low-methane-emission rice has 
been developed. Here we show that the addition of a single tran- 
scription factor gene, barley SUSIBA2 (refs 7, 8), conferred a shift 
of carbon flux to SUSIBA2 rice, favouring the allocation of photo- 
synthates to aboveground biomass over allocation to roots. The 
altered allocation resulted in an increased biomass and starch con- 
tent in the seeds and stems, and suppressed methanogenesis, pos- 
sibly through a reduction in root exudates. Three-year field trials in 
China demonstrated that the cultivation of SUSIBA2 rice was 
associated with a significant reduction in methane emissions and 
a decrease in rhizospheric methanogen levels. SUSIBA2 rice offers 
a sustainable means of providing increased starch content for food 
production while reducing greenhouse gas emissions from rice 
cultivation. Approaches to increase rice productivity and reduce 
methane emissions as seen in SUSIBA2 rice may be particularly 
beneficial in a future climate with rising temperatures resulting in 
increased methane emissions from paddies”. 

High-starch content and low-methane emissions are two important 
traits in rice breeding that are difficult to achieve simultaneously. In 
2002, high-yielding rice cultivars with improved productivity were 
proposed as a strategy for reducing methane emissions**. However, 
no such high-starch low-methane-emission rice has as yet been 
reported. Here we report the first example, to our knowledge, of such 
a rice, SUSIBA2 rice, generated via transcription factor technology. 

Sugar signalling in barley 2 (SUSIBA2) is a plant-specific tran- 
scription factor”*"' that regulates sugar-inducible gene expression, 
thereby mediating source-sink communication”®. High expression 
of SUSIBA2 is associated with an increase in sink strength and starch 
biosynthesis”*. We hypothesized that the overexpression of SUSIBA2 
in the seeds and stems of rice would increase sink strength in above- 
ground tissues and generate a high-starch low-methane-emission 
rice variety. 

Two stable rice lines (numbers 77 and 80) of homozygote trans- 
formants were selected in this study and defined as SUSIBA2 rice 


(SUSIBA2-77 and SUSIBA2-80, respectively). SUSIBA2-77 and its 
control, Nipponbare (Nipp), were cultivated in Fuzhou, China during 
the summers of 2012 and 2013. Results showed that cultivation of 
SUSIBA2-77 cut methane emissions to around 10% of control levels 
before flowering, and almost to zero (0.3% of the control level) at 
28 days after flowering (Fig. 1a). Genomic sequencing demonstrated 
that the observed emission trait was related to the introduced 
HvSUSIBA2 activity rather than its physical insertion site in the rice 
genome (Extended Data Fig. 1). Subsequent phytotron experiments 
corroborated the significant reduction of methane emissions in 
SUSIBA2 rice and showed that the measured methane emissions 
occurred in a linear fashion over time (Extended Data Fig. 2). 

To illustrate the role of SUSIBA2 rice in reducing methane 
emissions in different ecological environments and climates, we culti- 
vated SUSIBA2-77 and SUSIBA2-80 in the autumn of 2014 in Fuzhou, 
Guangzhou and Nanning, China at locations >500km apart, and 
measured diurnal and seasonal methane emissions (Extended Data 
Fig. 3). Both rice lines displayed similar traits in methane emissions, 
that is, less reduction in the morning, but significant reduction during 
the day (Extended Data Fig. 3). Interestingly, the observed pattern 
of emission reduction matched the predicted pattern of SUSIBA2- 
controlled sugar metabolism, which increases during the day and in 
the summer. 

To elucidate the mechanism underpinning the reduced methane 
emissions from SUSIBA2 rice, we took a three-pronged approach: (1) 
cultivating SUSIBA2 and control rice in phytotrons/fields and quan- 
tifying rhizospheric methanogen communities; (2) measuring pheno- 
typic traits of SUSIBA2 rice; and (3) characterizing genotypic traits. 

Strong blue-green autofluorescence when excited with light at 
420 nm wavelength is a characteristic of all methanogens due to the 
presence of the cofactor F420 (ref. 12). Fluorescence microscopy 
revealed that the major methanogenic autofluorescence was associated 
with the root tip and proximal regions, and much less signal was 
apparent in SUSIBA2-77 compared with Nipp (Fig. 1b). Quantifi- 
cation of the methanogenic communities indicated that the gene 
copy numbers of total archaea and methanogens and the orders 
Methanobacteriales, Methanomicrobiales and Methanocellales and 
two families Methanosaetaceae and Methanosarcinaceae of the order 
Methanosarcinales were significantly lower in SUSIBA2 rice compared 
with Nipp in both phytotron (Fig. 1c and Extended Data Fig. 4) and 
field conditions (Extended Data Fig. 5). We propose that the low- 
methane emissions from SUSIBA2 rice were due to a decrease in the 
rhizospheric abundance of methanogens. Methanocellales are the 
dominant methanogens in rice paddies'*"’, although recent reports 
indicate that other methanogenic groups can also be abundant in 
certain paddies’’. One plausible reason for the broad representation 
of methanogens detected in this study could be that plant residues were 
included as fertilisers in field and phytotron soils, possibly favouring 
richness in the methanogenic population'*’”. When we looked into the 
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Figure 1 | SUSIBA2 rice reduces methane emissions from rice paddies. 

a, Methane emissions from four rice plants (n = 4) for 2013 and three (m = 3) 
for 2012. b, Fluorescent microscopy of Nipp and SUSIBA2-77 roots. Scale bar, 
1 mm. ¢, Quantification of methanogens in soil and root samples from three 

rhizospheric positions (n = 3) of three plants (n = 3). Technical triplicates per 
position were applied. Quantification was performed for total archaea (ARC) 
and methanogens (MET), and the orders Methanobacteriales (MBT), 


specific group of Methanocella’* in the order of Methanocellales, we 
found that it was also significantly reduced in SUSIBA2 rice (Extended 
Data Fig. 5c). 

To investigate if photosynthate partitioning in SUSIBA2 rice was 
altered, we examined phenotypic traits in SUSIBA2-77 and Nipp. 
Compared with Nipp, SUSIBA2-77 has larger panicles with a higher 
proportion of filled grains, resulting in more drooping panicles 
(Fig. 2a, left and middle panels). In contrast, SUSIBA2-77 has a smaller 
root system than Nipp (Fig. 2a, right panel). Consistent with these 
observations, measurements of total dry biomass in aboveground 
and belowground tissues (Fig. 2b) showed that significantly more 
aboveground biomass and less root biomass were found with 
SUSIBA2-77. The increased aboveground biomass was associated with 
grain dry biomass and the numbers of filled grains (Fig. 2b). Plant 
height, thousand-grain weight (TGW), and numbers of panicles and 
tillers per plant were similar in SUSIBA2-77 and Nipp (Fig. 2b). 

Starch content in filled grains of SUSIBA2-77 increased to 86.9% dry 
weight compared with 76.7% in Nipp (Fig. 2c). Electron microscopy 


Methanomicrobiales (MMB) and Methanocellales and two families, 
Methanosaetaceae (Mst) and Methanosarcinaceae (Msc), of the order 
Methanosarcinales, respectively. Typical results from soil sample 1 of plant 1 
are shown. Differences between Nipp and SUSIBA2-77 were statistically 
significant (one-way ANOVA, *P =0.05 or **P = 0.01, error bars show s.d.). 
daf, days after flowering; DW, dry weight. 


examination showed that the starch granule sizes in mature seeds of 
SUSIBA2-77 were reduced (Fig. 2d). As the grain size was unaffected in 
SUSIBA2-77, we suggest that one possible reason for the increased 
starch content in mature SUSIBA2-77 grains is a more dense packing 
of smaller granules. A closer examination of starch content during seed 
development showed an elevation from 14 days after flowering (daf) 
(Fig. 2c). Notably, at the same developmental stage (14 daf), the starch 
content in stems was also significantly increased, but not in leaves and 
roots (Fig. 2c). 

To link the phenotypic traits with genotypic profiles, we examined 
the integration of HvSUSIBA2 in the SUSIBA2 rice genome, along with 
the gene and protein expression of HvSUSIBA2 and other represent- 
ative genes. Southern blot analysis revealed two copies of HvSBELIb 
p:HvSUSIBA2 in the SUSIBA2-77 and SUSIBA2-80 genome (Fig. 3a), 
in agreement with the sequencing results from SUSIBA2-77 (Extended 
Data Fig. 1b). The presence of sugar-responsive elements (SUREs) in 
the promoters of genes targeted by SUSIBA2 is critical for executing 
the sugar-signalling cascade that controls starch biosynthesis’*. Using 
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Figure 2 | Phenotypic profiling of SUSIBA2 rice and Nipponbare (Nipp). 
a, Panicles (left), aboveground plants (middle) and roots (right). b, For 
SUSIBA2-77 we observed significantly increased dry weight of aboveground 
biomass and filled grains, increased number of filled grains, and significantly 
decreased root dry weight. Plant height, thousand-grain weight, number 

of panicles and tillers were not significantly altered in SUSIBA2-77. Four plants 


the protocol described in ref. 7, we demonstrated the SURE-binding 
activity in rice by HvSUSIBA2 (Extended Data Fig. 6). To investigate 
gene expression, we selected 24 genes"* associated with sugar metabol- 
ism, including HvSUSIBA2 and OsSUSIBA2-like. Gene expression 
analyses showed that HvSUSIBA2 was highly expressed in early devel- 
oping seeds and stems, and at very low levels in leaves, roots and late 
developing seeds (Fig. 3b, upper two panels). The expression pattern 
correlated with the activity of the HvSBEIIb promoter in rice 
(Extended Data Fig. 7). The expression level of selected genes, apart 
from the control gene TIP41-like, followed the HvSUSIBA2 expression 
pattern with significant differences between SUSIBA2 rice and Nipp in 
seeds (Fig. 3b and Extended Data Figs 8 and 9) and, at least for some 
genes, in stems but not in leaves or roots. (Fig. 3b, Extended Data Fig. 8 
and Supplementary Table 2). Following transcriptomic analysis, the 
same tissues of SUSIBA2-77 were subjected to protein expression ana- 
lysis. Protein analysis indicated that all five selected proteins could be 
found at higher levels in stems, but not in leaves and roots in SUSIBA2- 
77 compared with Nipp (Fig. 3c, upper panel). In developing seeds, the 
membrane protein (SUT5) and starch granule-bound protein (GBSSI) 
were expressed more in SUSIBA2-77 than in Nipp (Fig. 3c, lower 
panel). The same trend was not obvious for the soluble proteins. In 
vitro degradation experiments suggested that the unchanged levels of 
soluble proteins in SUSIBA2-77 compared with Nipp might represent 
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were used (n = 4). c, For SUSIBA2-77 starch content in mature seeds, 
developing seeds and stems at 14 daf changed significantly, but did not change 
in leaves and roots. Three plants (n = 3) were used. d, Starch granules from 
mature seeds. Scale bar, 10 um. One-way ANOVA was used for statistical 
analysis (*P = 0.05 or **P = 0.01, error bars show s.d.). DW, dry weight; FW, 
fresh weight. 


a higher turnover of soluble proteins in SUSIBA2-77 (Fig. 3d), sup- 
ported by a zymogram activity assay of a soluble phase of starch 
branching enzyme I (Fig. 3e). 

We have proposed a model for how SUSIBA2 rice works (Fig. 4). The 
HvSBEIIb promoter activity is sugar-inducible via the activation of 
HvSUSIBA2 (refs 7, 8). When photosynthates (sugars) are available, 
HvSBEIIb p:HvSUSIBA2 enhances sugar-inducible activities of targeted 
genes, including the HvSBEIIb p construct. The augmented gene express- 
ion increases sink strength in tissues where HVSUSIBA2 is expressed. The 
increased sink strength draws more sugars from source tissues to yield 
more biomass and more filled grains associated with higher starch con- 
tent. The increased allocation of photosynthates further activates 
HvSBEIIb p:HvSUSIBA2, causing a ‘snowball effect’ for increasing sink 
strength. The snowball effect was illustrated by the exogenous sugar 
treatment of leaves and field trial experiments. Expression levels of the 
sugar-inducible genes were significantly higher in SUSIBA2 rice than in 
Nipp when sucrose was available (Extended Data Fig. 10). Methane 
emission reduction was more effective during the summer than during 
the autumn, and at noon rather than in the morning or later afternoon 
(Fig. la and Extended Data Fig. 3). The data indicate that the higher 
temperature and enhanced sugar metabolism during the summer and at 
noon may favour the SUSIBA2-derived snowball effect and carbon 
allocation to aboveground biomass. As suggested in the model, less sugar 
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Figure 3 | Genotypic profiling of SUSIBA2 rice and Nipponbare (Nipp). 

a, Two copies of HvSUSIBA2 were detected in SUSIBA2-77 and SUSIBA2-80. 
b, Downstream effects of HvSUSIBA2 expression in SUSIBA2-77. High 
expression of HvSUSIBA2 enhanced expression of 12 genes in stems and early 
developing seeds, with a corresponding reduction of gene expression in 
roots. Three plants (n = 3) and technical triplicates per plant were used. 

c, Western blot analysis. SUSIBA2 total represents levels of HVSUSIBA2 and 
OsSUSIBA2-like. Protein levels increased in stems of SUSIBA2-77, but 


not in leaves and roots (upper panel). Protein levels also increased in seeds for 
membrane proteins (SUTS5) and starch granule-bound proteins (GBSSJ), but 
not for soluble proteins (lower panel). d, In vitro study of degradation 
efficiency of soluble proteins in SUSIBA2-77 seeds. A higher degradation 
efficiency was detected in SUSIBA2 rice. e, Zymogram of starch branching 
enzyme activity. Soluble branching enzyme I (BEI) activity increased in 
SUSIBA2-77. One-way ANOVA was used for statistical analysis (*P = 0.05 or 
** P< (),01, error bars show s.d.). 
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Figure 4 | Model depicting high-starch low-methane-emission SUSIBA2 
rice. Sugar-inducible HvSUSIBA2 expression generates a snowball effect that 
ultimately leads to a rice plant with enhanced starch accumulation in seeds 
and stems and decreased carbon allocation to roots, which reduces 
methanogenic growth and methane emissions. Red dots represent 
methanogens. 


availability in SUSIBA2 rice roots results in a rhizosphere with less 
organic biomass and hence less root exudates as nutrients for inhab- 
iting methanogenic consortia. As a consequence, methane emissions 
decreased. Due to the biomass differences between SUSIBA2 rice and 
Nipp, we do not exclude the possibility that the emission reduction is 
due in part to impaired methane transport in SUSIBA2 rice. 

Finally, we suggest that the use of SUSIBA2 rice in cutting methane 
emissions from paddies may become more relevant with global warm- 
ing. Increased temperatures accelerate methane emissions in all eco- 
systems including paddies", but they also favour SUSIBA2-derived 
carbon allocation to seeds and aboveground biomass, thus counter- 
acting the temperature-driven acceleration of rice paddy methane 
emissions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Plant materials and growth conditions. Rice plants of variety Nipponbare 
(Oryza sativa L. ssp. Japonica) and transformed homozygote lines SUSIBA2-77 
and SUSIBA2-80 were grown in open fields or in a phytotron. Open field cultiva- 
tion was performed in a similar way to that described previously'*, but with rice 
straw in soil under natural conditions in Fuzhou, Guangzhou and Nanning, 
southern China, respectively. Phytotron conditions were applied to mimic field 
conditions, but with limited high temperatures. In the phytotron, rice plants were 
grown in cylinder-type pots (30cm high with an upper diameter of 29cm and 
bottom diameter of 19 cm) with organic soil containing plant residues. Phytotron 
growth management was similar to that described previously’? with a modified 
setting for rice, 14h light/10h dark at 30 °C/21 °C, a constant relative humidity of 
80% and light intensity of 400 jumol photons m~*s~* 

Plasmid construction and rice transformation. Plasmid construction and gen- 
eral molecular cloning procedures were performed according to previously 
developed protocols’*”°. Nucleotides 247-2067 of GenBank accession number 
AY323206, encoding barley SUSIBA2, were fused to nucleotides 1-1010 of barley 
SBEIIb promoter (HvSBEIIb p; GenBank accession number AF064563). The fused 
DNA fragment was cloned in the pCAMBIA 1301 binary vector (Extended Data 
Fig. 1a) and sequenced at Macrogen Europe (Amsterdam, the Netherlands). The 
sequence of the construct is in the Supplementary Information and deposited 
under GenBank accession number KR935231. The plasmid construct was used 
for Agrobacterium-mediated transformation of rice following the protocol in 
ref. 21. Screening of post-transformants was based on hygromycin resistance 
and PCR determination of T-DNA insertion. Out of 14 positive lines, five homo- 
zygous lines were selected for characterization. Eventually, two homozygote lines 
SUSIBA2-77 (Extended Data Fig. 1b) and SUSIBA2-80 were used for detailed 
studies. A binary vector containing HvSBEIIb p:GUS was also constructed and 
transformed to Nipponbare (Extended Data Fig. 7). The final construct was veri- 
fied by DNA sequencing, and transformed into Agrobacterium tumefaciens 
strain EHA105. 

Gene expression analysis by quantitative PCR (qPCR). RNA isolation, CDNA 
synthesis and qPCR analysis were performed in accordance with previous 
reports*’*. In brief, plant materials from different tissues were ground into fine 
powders in liquid nitrogen and total RNA was isolated by the Spectrum Plant Total 
RNA Kit (Sigma-Aldrich) according to the manufacturer’s protocol using 
approximately 30 mg of plant materials. All samples were treated with DNase I 
(Sigma-Aldrich) to remove trace amounts of DNA contamination. Total RNA of 
1 pg was used as a template for the cDNA synthesis with the Quanta qScript CDNA 
synthesis kit (Quanta Biosciences). The synthesized cDNA was adjusted to a 
concentration of 5 ng jl” ' and 15 ng used for qPCR analysis. qPCR reactions with 
at least 90% amplification efficiency were performed in a volume of 20 pl contain- 
ing 54M specific primers and a SYBR Green PCR master mix (Applied 
Biosystems, Life Technologies Europe BV). The PCR programme consisted of 
an initial temperature of 95°C for 4 min, and then 40 cycles of 30s at 95 °C and 
30s at 60°C. The melt curve was performed by increasing the temperature from 
60°C to 95 °C at a speed of 0.05 °C per second. qPCR-specific amplification was 
verified by a single band product in gel analysis. Data were calculated with the 
comparative C,; method'* and one-way ANOVA" was used for statistical analysis. 
The gene expression level by qPCR was normalized using housekeeping genes 
ACT1I (ref. 22) in developing seeds and Ubiquitin10 (ref. 22) in vegetative tissues, 
and using TIP41-like” as a control gene. 

Electrophoretic mobility shift assay (EMSA). Overexpressed HvSUSIBA2 pro- 
tein from E. coli was purified and used for EMSA. EMSA was performed essentially 
as described in ref. 7. The SURE sequences in the rice ISA promoter were found 
by manual search in the promoter sequence’’. The binding assay of HvSUSIBA2 
protein to labelled SURE oligonucleotides was performed by incubating the pro- 
tein with a DNA probe at room temperature for 30 min and then visualizing a 
protein-DNA complex ona native 5% polyacrylamide gel. The gel electrophoresis 
was conducted at 200 V for 2h. A DNA fragment of ‘A-rich stretch’ was used as a 
negative control. The gel was dried and autoradiography was carried out on an 
X-ray film overnight. 

Exogenous sucrose induction. Leaf blades were excised from rice plants after 10h 
in the dark during the early tillering stage and incubated with 100 mM sucrose 
solutions in a falcon tube (1 blade per tube) containing 2 ml sucrose solution for 
24h at 22 °C in the dark. Immediately after collection, the leaf blades were frozen 
in liquid nitrogen and stored at —70°C until further analysis. 

Validation of T-DNA insertion sites. The genomic DNA of SUSIBA2-77 was 
isolated using a DNA isolation kit (Qiagen GmbH, Hilden, Germany). The geno- 
mic DNA was fragmented by HindIII and ligated toa DNA adaptor also generated 
by HindIII. The HindIII-digested adaptor was purified before ligation using a PCR 
product purification kit (Qiagen). PCR cloning was performed using the ligated 
template to obtain a specific PCR product with the primers against T-DNA 
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board sequences and the adaptor sequences. The resulting PCR products were 
then sub-cloned into a PCR cloning vector by the TOPO10 TA cloning kit (Life 
Technologies Europe BV) for sequencing. 

qPCR quantification of methanogenic communities. Soil or root samples from 
three independent positions in the underground vicinity around the root tip and 
proximal regions (5cm from the rice plant and 5 cm depths, and 5 cm between 
positions) of three independent plants for Nipp, SUSIBA2-77 and SUSIBA2-80 
were collected at 3.00 p.m. from rice paddies or phytotrons, and the DNA was 
subsequently isolated using a DNA isolation kit for soil organisms (Fast DNA SPIN 
Kit for Soil; MP Biomedicals, LLC). The DNA was then diluted into four different 
series concentrations. qPCR quantification was performed for all four diluted 
DNA samples using a standard of a previously cloned 16S rRNA gene frag- 
ment”*”® for the individual groups of targeted methanogens and a newly PCR 
cloned 16S rRNA gene fragment using described primers’ from the strain 
Methanocella conradii DSM 24694 (German Collection of Microorganisms and 
Cell Cultures GmbH, Germany) for the genera of Methanocella. Only DNA copy 
numbers that were in agreement from two series of diluted samples were used for 
copy number determination. All methanogenic groups were analysed by using 
group-specific primers (Supplementary Table 1) with at least 90% amplification 
efficiency. Existing PCR products in all qPCR reactions were verified by gel ana- 
lysis, showing a single band of the expected size. The abundance of each group was 
calculated and translated to DNA copy numbers for each gram of dry root and/or 
soil. The abbreviations for each group are Mst (Methanosaetaceae), Msc 
(Methanosarcinaceae), MBT (Methanobacteriales), MMB (Methanomicrobiales), 
ARC (archaea) and MET (methanogens) according to ref. 26, and Met for 
Methanocella-specific (Supplementary Table 1). The qPCR programme for Mst 
was as follows: 95°C for 7 min, then 54 cycles of 40s at 95°C, 1 min at 61°C and 
40 s at 72 °C; for MBT: 95 °C for 7 min, followed by 54 cycles of 40 s at 95 °C, 1 min at 
58 °C and 40s at 72 °C; for MMB: 95°C for 7 min, followed by 54 cycles of 40s at 
95 °C, 1 min at 66°C and 40s at 72°C; for Msc, ARC and MET: 95 °C for 7 min, 
followed by 54 cycles of 40s at 95°C, 1min at 60°C and 40s at 72°C; for 
Methanocella-specific (Met) analysis: 94°C for 4 min, followed by 40 cycles of 30s 
at 94 °C, 1 min at 60 °C. All melting curves are from 55 °C to 95 °C with an increase of 
0.05 °C per second. 

Autofluorescence analysis of methanogens. Rice root tip and proximal regions 
at/after 28 days after flowering were picked at 3.00 p.m. and rinsed with pure water 
and then examined under a microscope at an excitation light wavelength of 
420 nm. Multiple observations on different batches of rice plants were performed. 
Southern blot analysis. Southern blot analysis was performed according to a 
previous protocol”. Briefly, rice genomic DNA was isolated from leaves using a 
CTAB method and 10 jig DNA was used for restriction digestion at 37 °C for 2h 
with BamHI. Digested DNA was applied to agarose gel separation immediately 
with an electrophoresis condition of 50 V for about 7 h. The separated DNA was 
denatured by an alkaline solution and transferred to the Hybond-N membrane 
overnight through capillary blotting with 10 SSC solution, and then the DNA 
was cross-linked to the membrane by UV-light for 5 min. The hybridization was 
performed overnight with a probe labelled with «-*’P dCTP using a random prime 
labelling kit (rediprime II, GE Healthcare) in a volume of 20 ml hybridization 
solution at 42 °C. After hybridization, the membrane was washed under a mod- 
erate stringent condition, 1X SSC at 50 °C. Autoradiography was conducted with 
an X-ray film on the membrane. 

Western blot analysis. Western blot analysis was performed as described 
previously’. Five proteins were selected for analysis: a membrane protein 
(sucrose transporter 5, SUTS), a starch granule-bound protein (granule-bound starch 
synthase I, GBSSI), and three soluble/soluble-phase proteins (SUSIBA2 total = barley 
HySUSIBA2 and rice OsSUSIBA2-like; UDP-glucose pyrophosphorylase 1, UGP1; 
branching enzyme I, BEI), respectively. Peptide antibodies against the five proteins 
were obtained from the Beijing Genomics Institute. The antibodies were raised 
against peptide positions of 300-330, 510-540, 440-470, 260-290 and 480-510 of 
GenBank accession numbers NM_001066651, DQ072593, DQ395328.1, X62134 
and D11082, for OsSUSIBA2-like, SUT5, UGP1, GBSSI and BEI, respectively. Rice 
tissue samples were collected at 3.00 p.m. and ground into fine powder. Total proteins 
were extracted from approximately 200 mg samples using the plant total protein 
extraction kit (Sigma-Aldrich, St. Louis, MO, US). The proteins were separated in 
a 4-12% gradient PAGE gel under a condition of 150 V for 1.5-2h, and then 
transferred to PVDF membrane by electro-blotting. The PVDF membrane was 
blocked in 5% milk in a TBS buffer for 1 h, and then incubated with the first antibody 
in the blocking solution overnight at room temperature. Incubation with the second 
antibody conjugated with a phosphatase was performed for approximately 2h after 
the membrane was washed by TBST and TBS solutions. Immuno-reacted bands were 
visualized in a BCIP/NBT (substrates for the phosphatase) solution for colour 
development. 
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Methane collection in rice paddies and phytotrons. Methane sampling and 
determination were based on published protocols” ”’. Individual rice plants were 
covered with a sealed plastic cylinder (diameter: 15 cm, height: 45 cm for early 
stages and 95cm for late stages) and 3 X 25 ml gas samples were taken from six 
independent plants by a syringe from the headspace after 10, 20 and 30 min and 
pooled in a sealed airbag. Sampling time in rice paddies was 8.00 a.m. for the 
summer (June-August) of 2012 and 2013, and 8.00 a.m., 12.00 p.m. and 4.00 p.m. 
for the autumn (September-November) of 2014, respectively. Temperatures were 
recorded. The samples in the phytotrons were taken from six independent plants 
at 3.00 p.m. after 15 min coverage for each plant. Four technical repeats, that is, 
four vials, were used for each plant methane collection. All gas samples were 
analysed by gas chromatography with appropriate methane standards. Methane 
in samples was quantified by calculating the sample peak area in comparison with 
the standard peak area of known amounts. An air methane concentration of 1.8 
p-p.m. was used as the background for calculations. Methane flux calculation was 
according to the formula described by ref. 29. 

Degradation efficiency analysis. Soluble proteins were extracted from the seeds 
at 7 daf without the addition of protease inhibitors according to a protocol 
described previously”*. The protein extracts were adjusted to the same concentra- 
tion at time = 0 and incubated at 28° C for 2 or 4h. Western blot analysis was used 
to check the remaining protein levels. 

Total starch analysis. Total starch was extracted and analysed using a total starch 
assay kit (Megazyme, Bray, Co. Wicklow, Ireland). Rice tissues were collected at 
3.00 p.m. and ground into fine powder in liquid nitrogen and 100 mg samples of 
tissues were used for total starch analysis after removing the soluble sugars with 
80% ethanol. Total starch quantification and calculation followed the manufac- 
turer’s protocol exactly. 

Zymogram analysis of starch branching enzyme (SBE) activity. Zymogram 
analysis was performed according to a protocol described previously*. Equal 
amounts of total soluble proteins from Nipp and SUSIBA2-77 developing seeds 
at 7 daf were separated in a 10% (w/v) polyacrylamide gel containing 1% (w/v) 
starch. After electrophoresis, the gel containing starch branching enzymes was 
washed with water and incubated at 30 °C in a buffer containing 100 mM sodium 
citrate (pH 7.0) for 4-8 h. The gel was then transferred to an I,/KI solution to 
visualize reddish-purple bands against the dark blue starch gel for SBE activity. 
Scanning electron microscopy of starch granules. Starch granules of 
Nipponbare and SUSIBA2-77 were isolated as described*. Coating of the granules 


was carried out in a high-resolution sputter coater, with platinum/palladium as the 
target. Scanning electron microscopy was performed on a JSM-6320F (JEOL Ltd, 
Akishima, Tokyo, Japan). 

Molecular cloning of Methanocella-specific 16S rRNA genes. Genomic DNA of 
Methanocella conradii DSM 24694 were isolated by the DNA isolation kit for soil 
organisms (FastDNA SPIN Kit for Soil; MP Biomedicals, LLC) and used as PCR 
templates for PCR-based cloning of 16S rRNA. PCR-amplified products were sub- 
cloned into a PCR cloning vector by the TOPO10 TA cloning kit (Life 
Technologies Europe BV). The cloned genes were verified by DNA sequencing. 
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Extended Data Figure 1 | Validation of an expression cassette containing was performed by PCR-based cloning using primers and HindIII-adaptor 

barley SBEIIb promoter and barley SUSIBA2 (HvSBEIIb p:HvSUSIBA2) in _ ligation, followed by sequencing. Two insertion sites were identified in rice 

a binary vector and rice genome. a, Construction of an expression cassette chromosome 2, from nucleotides 57443 to 57444 and 57447 to 57448 
(GenBank accession number AP006069), respectively. 


containing HvSBEIb p:HvSUSIBA2 in a binary vector was performed as 
described in the Methods. b, Validation of the construct in the rice genome 
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Extended Data Figure 2 | Methane emissions of SUSIBA2 rice compared concentrations at 30 and 60 min was found, as presented by similar methane 
with Nipponbare (Nipp) in phytotrons. a, Methane emission of Nipp, fluxes determined from the different time points. Three plants (n = 3) from 
SUSIBA2-77 and SUSIBA2-80 rice (15 daf). Six independent plants (” = 6) each rice line were used for time point measurements. A statistically significant 


from each rice line were used for measurements. b, Methane emission of Nipp _ reduction of methane emission in SUSIBA2 rice is indicated (one-way 
and SUSIBA2-77 (28 daf) at 30 min (left panel) or 60 min (right panel) after ANOVA, **P <0.01 or *P $0.05, error bars show s.d.). 
plants were covered. A linear relationship over time for the measured methane 
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a CH, emissions from Nipp, SUSIBA2-77 and SUSIBA2-80 during the autumn of 2014 in Fuzhou (26°00'N, 119°18’E) 
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Extended Data Figure 3 | Diurnal and seasonal methane emissions from 
SUSIBA2-77 and SUSIBA-80 rice compared with Nipponbare (Nipp) in 
autumn 2014 at three sites in China. Methane emissions of six independent 
plants (n = 6) from three time points during the day (morning 8.00 a.m., 
noon 12.00 p.m. and afternoon 4.00 p.m.) on different dates are presented. Key 
rice development stages for the corresponding dates are indicated. Time points 


for sampling soil and roots for methanogen analysis are indicated by red 
arrows. a, Methane emission from Fuzhou. b, Methane emission from 
Guangzhou. c, Methane emission from Nanning. Reported statistically 
significant reduction of methane emission in SUSIBA2 rice is indicated 
(one-way ANOVA, **P = 0.01 or *P = 0.05, error bars show s.d.). 
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Extended Data Figure 4 | qPCR quantification of rhizospheric 
methanogenic communities associated with SUSIBA2-77 rice and Nipp in 
phytotrons. Soil and root samples from three independent positions, positions 
1-3 (n = 3), in the underground vicinity close to the root tip and proximal 
regions of three independent plants (n = 3) for Nipp and SUSIBA2-77 rice were 
collected and analysed. Technical triplicates per position were applied. Six 
pairs of primers (Supplementary Table 1) were used to quantify total archaea 


(ARC) and methanogens (MET), and the orders Methanobacteriales (MBT), 
Methanomicrobiales (MMB) and Methanocellales and two families 
Methanosaetaceae (Mst) and Methanosarcinaceae (Msc) of the order 
Methanosarcinales, respectively. Results from soil and root samples are shown 
in aand b, respectively. Existing numbers of all methanogenic groups and total 
archaea were significantly reduced in the SUSIBA2 rhizosphere compared to 
that of Nipp (one-way ANOVA, **P = 0.01 or *P = 0.05, error bars show s.d.). 
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Extended Data Figure 5 | qPCR quantification of rhizospheric Methanomicrobiales (MMB) and Methanocellales and two families 
methanogens associated with SUSIBA2 rice and Nipponbare (Nipp) Methanosaetaceae (Mst) and Methanosarcinaceae (Msc) of the order 
from rice paddies. Soil and root samples (a mixture) from three positions Methanosarcinales, respectively. Primers (Supplementary Table 1) specific to 
(n = 3) close to the root tip and proximal regions of three independent plants | Methanocella were also used for quantification. a, Methanogenic communities 
(n = 3) for Nipp, SUSIBA2-77 and SUSIBA2-80 rice, respectively, were in samples from Fuzhou. b, Methanogenic communities in samples from 
collected. The sampling time and sites are indicated in Extended Data Fig. 3. Nanning. c, Methanocella in samples from Nanning. All methanogenic groups 
Technical triplicates per position were applied. Six pairs of primers and total archaea were significantly reduced in the SUSIBA2 rice rhizosphere 
(Supplementary Table 1) were used to quantify total archaea (ARC) and compared with Nipp (one-way ANOVA, **P = 0.01 or *P = 0.05, error bars 
methanogens (MET), and the orders Methanobacteriales (MBT), show s.d.). 
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Extended Data Figure 6 | Binding activity of HYSUSIBA2 to SURE negative (‘A stretch’) control was included. b, Barley SUSIBA2 protein 
sequences in the rice ISAI promoter. a, Three SURE sequences in the rice (HvSUSIBA2) was overexpressed from E. coli and used for electrophoretic 


ISA1 promoter (GenBank accession number AB093426) were used and a mobility shift assay (EMSA). 
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Extended Data Figure 7 | HvSBEIIb promoter activity analysis in rice leaves. a, Nipp leaf. b, Transformant leaf. c, Nipp stem. d, Transformant stem. 
seedlings. a, HvSBEIIb p:GUS was introduced in Nipponbare (Nipp). GUS e, Nipp root. f, Transformant root. g, Nipp leaf induced by 100 mM sucrose. 
activity was stained in different tissues of transformant and Nipp lines. The h, Transformant leaf induced by 100 mM sucrose. Scale bars, 2 mm. 


GUS activity was found in transformant stems and induced by sucrose (Suc) in 
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Extended Data Figure 8 | Transcriptomic analysis of genes related to sugar _ performed for each rice line. SUT3 and SUS5/7 transcripts were not detected in 
metabolism in late tillering plants of SUSIBA2 rice and Nipponbare (Nipp). _ all samples (not shown) and HvSUSIBA2 expression was similar in SUSIBA2-77 
Relative expression levels of 23 genes together with a housekeeping gene, TIP41- and SUSIBA2-80 rice, as presented in Fig. 3b. One-way ANOVA was used 
like, were analysed by qPCR and compared between Nipp and SUSIBA2-77 for statistical analysis (*P = 0.05 or **P = 0.01, error bars show s.d.). ND, not 
and SUSIBA2-80 rice in leaves, stems and roots at late tillering stageandinseeds _ detected. 

at 7 daf. Three plants were used (n = 3) and technical triplicates were 
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Extended Data Figure 9 | Transcriptomic analysis of 23 genes related to 
sugar metabolism in developing rice seeds of SUSIBA2 rice and Nipponbare 
(Nipp). Relative expression levels of 23 genes together with a housekeeping 
gene, TIP41-like, were analysed by qPCR and compared between SUSIBA2-77 


rice and Nipp at 5, 7, 9, 11 and 14 daf. Seeds from three independent plants 
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were used (n = 3) and technical triplicates were performed. SUT3 and SUS5/7 
transcripts were not detected in all samples (not shown) and HvSUSIBA2 
expression is presented in Fig. 3b. One-way ANOVA was used for statistical 
analysis (*P = 0.05 or **P = 0.01, error bars show s.d.). 
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Extended Data Figure 10 | Sugar-induction in Nipponbare (Nipp) and either rice cultivar under dark conditions) in SUSIBA2-77 was observed 


SUSIBA2 rice. Rice leaves of Nipp and SUSIBA2-77 were depleted for sucrose compared with Nipp. One-way ANOVA was used for statistical analysis 

in the dark before induction with 100 mM sucrose. Sugar induction was carried (*P=0.05 or **P = 0.01, error bars show s.d.). The experiment was repeated at 
out in the dark for 24 h. An enhanced expression of HvSUSIBA2-regulated least three times (n = 3). 

genes (9 of the 12 genes except SUT5, SUSI and GBSSI that were not detected in 
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Lanosterol reverses protein aggregation in cataracts 


Ling Zhao!'*3*+, Xiang-Jun Chen**, Jie Zhu*°*, Yi-Bo Xi**, Xu Yang°*, Li-Dan Hu**, Hong Ouyang”®, Sherrina H. Patel’, 
Xin Jin®, Danni Lin’, Frances Wu’, Ken Flagg’, Huimin Cai’’, Gen Lil, Guiqun Caol, Ying Lin??, Daniel Chen?, 

Cindy Wen*, Christopher Chung’, Yandong Wang’, Austin Qiu**, Emily Yeh*?, Wenqiu Wang*’, Xun Hu', Seanna Grob’, 
Ruben Abagyan’®, Zhiguang Su’, Harry Christianto Tjondro*, Xi-Juan Zhao*, Hongrong Luo’, Rui How’, J. Jefferson P. Perry", 
Weiwei Gao*!?, Igor Kozak'’, David Granet’, Yingrui Li®, Xiaodong Sun’, Jun Wang®, Liangfang Zhang*"’, Yizhi Liu’, 
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The human lens is comprised largely of crystallin proteins assembled 
into a highly ordered, interactive macro-structure essential for lens 
transparency and refractive index. Any disruption of intra- or 
inter-protein interactions will alter this delicate structure, expos- 
ing hydrophobic surfaces, with consequent protein aggregation 
and cataract formation. Cataracts are the most common cause of 
blindness worldwide, affecting tens of millions of people’, and 
currently the only treatment is surgical removal of cataractous 
lenses. The precise mechanisms by which lens proteins both 
prevent aggregation and maintain lens transparency are largely 
unknown. Lanosterol is an amphipathic molecule enriched in the 
lens. It is synthesized by lanosterol synthase (LSS) in a key cycliza- 
tion reaction of a cholesterol synthesis pathway. Here we identify 
two distinct homozygous LSS missense mutations (W581R and 
G588S) in two families with extensive congenital cataracts. Both 
of these mutations affect highly conserved amino acid residues and 
impair key catalytic functions of LSS. Engineered expression of wild- 
type, but not mutant, LSS prevents intracellular protein aggregation 
of various cataract-causing mutant crystallins. Treatment by lanos- 
terol, but not cholesterol, significantly decreased preformed protein 
aggregates both in vitro and in cell-transfection experiments. We 
further show that lanosterol treatment could reduce cataract severity 
and increase transparency in dissected rabbit cataractous lenses 
in vitro and cataract severity in vivo in dogs. Our study identifies 
lanosterol as a key molecule in the prevention of lens protein 
aggregation and points to a novel strategy for cataract prevention 
and treatment. 

Cataracts account for over half of all cases of blindness worldwide, 
with the only established treatment involving surgical removal of the 
opacified lens. In developed nations, cataract surgeries amount to a 
significant portion of healthcare costs owing to the sheer prevalence of 
the disease among ageing populations. In addition, there is major 
morbidity associated with cataracts in developing countries, where 
there is limited access to surgical care. 

High concentrations of crystallin proteins in lens fibres contribute 
to lens transparency and refractive properties’. The crystallin super- 
family is composed of «-, B- and y-crystallins, which are some of the 
most highly concentrated intracellular proteins in the human body. 
Protein aggregation is the single most important factor in cataract 
formation’. Factors that lead to protein aggregation include muta- 
tions in crystallin proteins, which are known to cause congenital 
cataracts, or oxidative stress, which in turn contributes to age- 
related cataracts. However, the precise mechanisms by which lens 


proteins maintain transparency or cause opacification are not com- 
pletely understood. 

Lanosterol synthase (2,3-oxidosqualene-lanosterol cyclase, LSS; EC 
5.4.99.7) is encoded by the LSS gene. The LSS protein catalyses the 
conversion of (S)-2,3-oxidosqualene to lanosterol, which is a key early 
rate-limiting step in the biosynthesis of cholesterol, steroid hormones, 
and vitamin D (ref. 4). LSS was found to be expressed in the lens’. It 
was previously reported that the specific combination of hypomorphic 
mutations on LSS and FDFT1 (farnesyl diphosphate farnesyl transfer- 
ase 1) could decrease cholesterol levels in the lens and result in catar- 
acts in Shumiya cataract rats (SCR)°. Here we identify novel 
homozygous mutations in the LSS gene in two consanguineous fam- 
ilies and investigate the ability of lanosterol to alleviate protein 
aggregation and cataract formation. 

We identified three children with severe congenital cataract from a 
consanguineous family of Caucasian descent (Fig. 1a). We performed 
whole-exome sequencing to an average of no less than 55-fold 
depth coverage on the target region (Extended Data Table 1a) in order 
to identify the causal mutation. On average, ~60,800-80,800 SNPs 
were detected in each exome (Extended Data Table 1b). Using a con- 
sanguineous recessive model and filtering against common variants 
(minor allele frequency >0.5%) in public databases, including dbSNP 
and the 1000 Genomes Project, as well as mutation function predic- 
tions (predicted by SIFT’, Polyphen2°, Phylop’ and Mutationtaster’’), 
we narrowed down potential candidate gene variants and identified 
a variant (G588S) in LSS on chromosome 21 as the most likely can- 
didate (Extended Data Table 1c). Three affected children were homo- 
zygous for the G—A transition (G588S) in LSS, (GRch37/hg19: 
chr21:47615645; NM_001001438.2:c.1762G > A, NM_001001438.1: 
p.G588S8), while the unaffected father, mother and remaining child 
were heterozygous for the change (Fig. la, b). Whole-genome SNP 
genotyping identified three long continuous homozygous regions in 
this family by HomozygosityMapper” (chr2:q22.1-q24.1, chr2:q31.1- 
q32.1 and chr21:q22.3; Extended Data Fig. 1a and Extended Data Table 
1d). The LSS gene was located in one of the homozygous regions on 
chromosome 21 (Extended Data Fig. 1b). Furthermore, we screened for 
mutations in the LSS gene in 154 families with congenital cataracts and 
identified another homozygous mutation, W581R (GRch37/hg19: 
chr21:47615666; NM_001001438.2:c.1741T >C, NM_001001438.1: 
p.W581R), in a second consanguineous family (Fig. 1a, b, c). These 
two mutations were absent in 11,000 control chromosomes. 

The amino acid residues W581 and G588 in LSS are highly con- 
served (Fig. 2a). We performed computational modelling analysis to 


Molecular Medicine Research Center, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, Chengdu 610041, China. *State Key Laboratory of Ophthalmology, Zhongshan 
Ophthalmic Center, Sun Yat-sen University, Guangzhou 510060, China. ?Department of Ophthalmology and Biomaterials and Tissue Engineering Center, Institute for Engineering in Medicine, University of 
California San Diego, La Jolla, California 92093, USA. “State Key Laboratory of Membrane Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China. "Department of Ophthalmology, 
Xijing Hospital, Fourth Military Medical University, Xi'an 710032, China. °BGI-Shenzhen, Shenzhen 518083, China. ’Guangzhou KangRui Biological Pharmaceutical Technology Company, Guangzhou 
510005, China. ®CapitalBio Genomics Co., Ltd., Dongguan 523808, China. ?Department of Ophthalmology, Shanghai First People’s Hospital, School of Medicine, Shanghai JiaoTong University, Shanghai 
20080, China. !°Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California 92093, USA. '!Department of Biochemistry, University of California 
Riverside, Riverside, California 92521, USA. !*Department of Nanoengineering, University of California, San Diego, La Jolla, California 92093, USA. !?King Khaled Eye Specialist Hospital, Riyadh, Kingdom of 
Saudi Arabia. !4Veterans Administration Healthcare System, San Diego, California 92093, USA. +Present address: Institute of Molecular Medicine, Peking University, Beijing 100871, China. 


*These authors contributed equally to this work. 


30 JULY 2015 | VOL 523 | NATURE | 607 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


\ 
\ n\n 


f 
Wild type 1 
CCTGGGAAGGCTCCTGGGGAGTTTGCTTCA 


! 


W581R A \\ Ww 


CCTGGGAAGGCTCCCGGGGAGTTTGCTTCA 


G588S/+ 


Wild type NANA ANW\ | WL M 


G588S/G588S G588S/G588S G588S/G588S G588S/+ } 


GITTGCTTCACCTACGGCACCTGGTTTGGC 
1 2 = 


tainty 


W581R/+ G588S 
GTTTGCTTCACCTACAGCACCTGGTTTGGC 
— 
4 
I 
W581R/W581R W581R/+ 
Patient 2 


c Patient 1 


Figure 1 | Identification of mutations in LSS causing congenital cataracts. 
a, Pedigrees of affected families and cataract phenotype. Squares and circles 
indicate males and females respectively. +, wild-type allele; W581R and 
G588S are the two mutations. b, Upper panel, DNA sequencing data of an 
unaffected individual and an affected child (II-1) with a homozygous W581R 
mutation; lower panel, DNA sequencing data of an unaffected individual 
and an affected child (IV-1) with a homozygous G588S mutation. The 
underlined sequence indicates the nucleic acid change. c, Left, colour 
photograph of patient 1’s right eye in the first pedigree (IV-1) with a total 
cataract; right, colour photograph of patient 2’s right eye in the same pedigree 
(IV-3) with a cataract. 


investigate the effects of the W581R and G588S mutations on the 3D 
structure and function of LSS. The amino acid tryptophan at position 
581 has been reported to contribute to the catalytic site of the cyclase 
activity’*. The G588S mutant was modelled by in-place replacement 
followed by side-chain refinement. The $588 side-chain refinement 
could not resolve the van der Waals clash between the serine side chain 
and the backbone carbonyl of E578, which forms a key salt bridge with 
R639. The orientation of the E579:C584 loop needed to be distorted to 
accommodate the mutation. The side chain of the mutant $588 clashed 
into an adjacent loop, indicating that the mutation was incompatible 
with the normal enzymatic structure and function of LSS (Fig. 2b). 
Supporting the in silico results, expression of wild-type LSS in a cell- 
transfection experiment exhibited cyclase activity and dramatically 
increased the amount of lanosterol production in the lipid fraction 
in HeLa cells, while neither the W581R nor the G588S mutant protein 
demonstrated any cyclase activity (Fig. 2c). 

In contrast, the cholesterol level was unaffected by the expression of 
wild-type or mutant LSS, suggesting that there may be an alternative 
pathway for cholesterol homeostasis. The W581R and G588S muta- 
tions did not alter subcellular localization or cause aggregates of LSS 
protein when compared to that of wild-type LSS, suggesting that the 
cataract phenotype was not due to the formation of light-scattering 
particles by mutant LSS proteins themselves (Extended Data Fig. 2). 

The aggregation of crystallins, the major structural proteins in the 
lens, is a predominant cause of various types of cataracts*. To mimic 
protein aggregation in the cataractous lens, six known cataract-causing 
mutant crystallin proteins were expressed in human lens progenitor 
cells, human lens epithelial line B-3 (HLEB-3), or HeLa cells. These 
mutant crystallins formed p62-positive inclusion bodies/aggresomes 
in all three transfected cell lines, suggesting that aggregation is an 
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Figure 2 | LSS mutations abolished the cyclase enzymatic function. 

a, Conservation of W581R and G588 in LSS across several species: Homo 
sapiens, Pan troglodytes, Bos taurus, Mus musculus, Rattus norvegicus, Gallus 
gallus and Danio rerio. b, Computer modelling of LSS structure and impact of 
the LSS W581R and G588S mutations. A computer modelling analysis 
identifies a loop originating from C584 and ending at E578 with the key side 
chain of W581 at the tip of the loop stabilizing the sterol. The loop is fixed by 
an S-S bridge and the E578-R639 salt bridge. Amide nitrogen N of G588 
interacts with the C584 from the previous helical turn and the Cx hydrogen of 
G588 is in close proximity to the critical E578, which then forms a strong 
salt bridge with R639 of the same supporting helix. The mutation G588S causes 
the side chain of the serine to clash into the E578 residue of the loop and is 
incompatible with the structure. Arrow indicates the location of the mutant side 
chain. ¢, Effect of engineered expression of the wild-type protein (WT LSS) 
and LSS mutants on sterol content. Wild-type LSS markedly increased 
lanosterol production, whereas neither W581R nor the G588S mutant 
exhibited any cyclase activity. n = 3 in each group; ***P < 0.001. 


intrinsic property of mutant crystallins (Fig. 3a and Extended Data 
Figs 3 and 4)'*. Co-expression of wild-type LSS and a cataract-causing 
mutant crystallin protein significantly reduced both the number and 
size of intracellular crystallin aggregates, whereas LSS mutants failed to 
do so alone (Fig. 3b, c and Extended Data Figs 3 and 4). Western blot 
analysis indicated that the Y118D mutant of «A-crystallin was released 
from intracellular aggregates and became more soluble with wild- 
type LSS (Fig. 3d and Extended Data Fig. 4c). Furthermore, addition 
of lanosterol, but not cholesterol, in the culture medium of cells co- 
expressing an LSS mutant and a mutant crystallin successfully reduced 
crystallin aggregation (Fig. 3c and Extended Data Figs 3 and 4). This 
result indicated that lanosterol, but not cholesterol, could be an effec 
tive agent to release mutant crystallin proteins from aggregation. 
Supporting this hypothesis, lanosterol significantly inhibited aggre- 
some formation of both wild-type and mutated crystallin proteins in 
a concentration-dependent manner, while cholesterol had no effect 
(Fig. 3e, fand Extended Data Fig. 5). We further showed that lanosterol, 
but not cholesterol, increased the amounts of mutant crystallins in the 
soluble fractions of cell lysates (Fig. 3g and Extended Data Fig. 6a). 
Using serial live-cell imaging of cells expressing a GFP-fused Y118D 
mutant of «A-crystallin, we showed that addition of lanosterol could 
effectively diminish crystallin aggregates with a half-life of 222 + 8 min 
(Fig. 3h), whereas addition of DMSO or cholesterol did not reduce 
aggresome formation (Extended Data Fig. 6b). Single-particle tracking 
in live cells clearly showed that lanosterol has an important role in the 
dissociation of pre-formed intracellular protein aggregates. 

To investigate whether lanosterol has a direct effect on the dissolu- 
tion of aggregated proteins, the aggregates of five wild-type and nine 
mutant crystallins were obtained by heating wild-type and mutated 
crystallins in the presence of 1 M guanidine chloride. Under this con- 
dition, all crystallin proteins formed amyloid-like fibrils as revealed 
by the enhancement of thioflavin T (ThT) fluorescence, the fibrillar 
structures under negatively stained transmission electron microscopy 
(TEM), and the low turbidity value (Fig. 4 and Extended Data Fig. 6c). 
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Figure 3 | Lanosterol reduced intracellular aggregation of various crystallin 
mutant proteins. a, Confocal images of crystallin protein aggregates in human 
lens progenitor cells. The cataract-causing Y118D mutant of «A-crystallin 
formed p62-positive intracellular inclusion bodies or aggresomes. Green, 
eGFP-crystallin proteins; red, p62; blue, nuclei. Cells transfected with peGFP- 
N1 were used as a control. b, Confocal images of the inhibitory effect of LSS 
on crystallin aggregates. c, Inhibition of crystallin mutant aggregation by 
wild-type LSS (WT LSS) and lanosterol, but not mutant LSS or cholesterol. 
d, Increase in soluble «A-crystallin(Y118D) mutant protein by co-expression of 
wild-type LSS but not LSS mutants (Y118D co-expressed with pcDNA3.1-N- 
Flag was used as a control). Quantitative analysis was performed using 
densitometry of crystallin proteins by western blot analysis of the supernatant 
or insoluble fraction of cell lysates. n = 3 in each group; representative 
western blot analysis is shown in Extended Data Fig. 4c; *P < 0.05, **P < 0.01. 
e, Confocal images of the re-dissolution of preformed crystallin aggregates 


The morphology of the amyloid-like fibrils obtained here was similar 
to those crystallin proteins reported previously'’*. We used PBS- 
containing liposomes formed by dipalmitoyl phosphatidylcholine 
(DPPC) to increase the solubility of sterol compounds and mimic 
the condition of sterols in cell membranes. Lanosterol, but not choles- 
terol, successfully re-dissolved the aggregated crystallin proteins from 
the amyloid-like fibrils in a concentration-dependent manner as 
indicated by the disappearance of fibrillar structures in the negatively 


by lanosterol. Arrows indicate the presence of crystallin aggregation. 

f, Lanosterol significantly reduced the intracellular aggregation by various 
cataract-causing mutant crystallin proteins in a concentration-dependent 
manner. n = 3; P<1X10 “. Cholesterol did not reduce intracellular 
aggregation. n = 3; P> 0.1. g, Lanosterol increased the soluble fractions of 
various crystallin mutants in human lens progenitor cells. n = 3; P< 0.001. 
h, Effects of DMSO, cholesterol or lanosterol on «A-crystallin(Y118D) 
aggregates in human lens progenitor cells by serial live-cell imaging. 
Progression of crystallin aggregation dissolution by lanosterol can be observed, 
as evidenced by decreased green fluorescence following the time-course. 

i, Effect of lanosterol on dissolution of intracellular crystallin aggregates over 
time. n = 22 from three biological replicates. The 22 repetitions are shown in 
open circles distinguished by different colours. The mean = s.d. values are 
shown as filled black circles and error bars. The data are best fitted by the 
single exponential decay process (red line). Scale bars, 10 um. 


stained TEM photographs and the decrease in ThT fluorescence 
intensity (Fig. 4 and Extended Data Fig. 6d). As an example, the 
re-dissolved «A-crystallins could be identified in negatively stained 
TEM pictures and were around 15 nm in size (Fig. 4a)". 

To assess the effect of lanosterol on cataract reduction in lens tissues, 
we isolated naturally occurring cataractous lenses from rabbits. We 
incubated these cataractous lenses in a 25mM lanosterol solution 
for 6days and compared lens clarity before and after treatment of 
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Figure 4 | Lanosterol re-dissolved pre-formed amyloid-like fibrils of 
crystallin proteins. a, Negatively stained TEM photographs of aggregates of 
aA-crystallin mutant proteins treated by a liposome vehicle, cholesterol or 
lanosterol in liposomes. Images in the right column of the lanosterol group 
show a 5X magnification of the image on their left. b, Effect of lanosterol on the 
re-dissolution of crystallin aggregates by ThT fluorescence (n = 3). Left, 
B/y-crystallin mutants; right, o-crystallin mutants. Each bar results from three 
independent samples. 


lanosterol. We observed a strong trend of reduction in cataract sever- 
ity, as demonstrated by an increase in lens clarity (P < 0.003, Wilcoxon 
Test, Fig. 5a, b, Extended Data Table 2a and Extended Data Fig. 7a, b). 
We further investigated the effect of lanosterol in reversing cataracts in 
dogs in vivo. Lanosterol treatment significantly reduced cataract sever- 
ity and increased lens clarity (P< 0.009, Wilcoxon Test, Fig. 5c, d; 
Extended Data Table 2b and Extended Data Fig. 7c). 

In this study, we demonstrated that homozygous mutations affect- 
ing the catalytic function of LSS cause extensive congenital cataracts 
with severe vision loss. The critical role of lanosterol in cataract 
prevention is supported by the observation that a rat strain harbouring 
compound LSS mutations recapitulates the human cataract disease 
phenotype®. Consistent with this notion, inhibition of LSS by 
U18666A, an LSS inhibitor (also known as an oxidosqualene cyclase 
inhibitor), was found to cause cataracts'®. Furthermore, lanosterol 
treatment both decreased protein aggregation caused by mutant crys- 
tallin proteins in cell culture and reduced preformed cataract severity 
by increasing lens clarity in animal models. It is conceivable that the 
amphipathic nature of lanosterol allows it to intercalate into and coat 
hydrophobic core areas of large protein aggregates, effectively allowing 
these aggregations to gradually become water soluble again. 

In summary, we show that lanosterol plays a key role in inhibiting 
lens protein aggregation and reducing cataract formation, suggesting a 
novel strategy for the prevention and treatment of cataracts. Cataracts 
are the leading cause of blindness and millions of patients every year 
undergo cataract surgery to remove the opacified lenses. The surgery, 
although very successful, is nonetheless associated with complications 
and morbidities. Therefore, pharmacological treatment to reverse cat- 
aracts could have large health and economic impacts. In addition, 
our results may have broader implications for the treatment of 
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Figure 5 | Lanosterol reduced cataract severity and increased clarity. 

a, Photographs of a cataractous rabbit lens treated with lanosterol showing 
increased lens clarity. Left, before treatment; right, after. b, Boxplot of the 
quantification of the treatment effect of lanosterol ( = 13). c, Photographs of 
a cataractous dog lens treated with lanosterol showing increased lens clarity. 
Left, before treatment; right, after. d, Boxplot of the quantification of the 
treatment effect of lanosterol (n = 7). Range, median (horizontal line) and 
mean (circle) are presented. Crosses indicate the maximum and minimum 
cataract grades measured. Whiskers indicate the standard deviation and the 
box encompasses a 40% confidence interval. 


protein-aggregation diseases, including neurodegenerative diseases 
and diabetes, which collectively are a significant cause of morbidity 
and mortality in the elderly population, by encouraging the investiga- 
tion of small-molecule approaches, such as the one demonstrated here. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Study participants. This study was approved by the Institutional Review Boards 
of Zhongshan Ophthalmic Center of Sun Yat-sen University, Sichuan University 
and the University of California, San Diego. Informed consent was obtained from 
all subjects before participation in the study. All participants underwent standard 
complete ophthalmic examinations and imaging studies. Demographic data, risk 
factors, and blood samples were collected at the initial visit. We recruited a con- 
sanguineous family consisting of two adults and four children. The parents were 
first cousins, and three of their four children were diagnosed with cataracts 
(Fig. 1a). We screened for LSS mutations in an additional 154 congenital cataract 
pedigrees and identified another family with a homozygous W581R mutation. 
Exome capture and sequencing. Exome capture of phase I data (mother and 
three affected children) and phase II data (father and one unaffected daughter) 
were hybridized with NimbleGen 2.1M probe and Agilent SureSelect Human All 
Exon V2 according to the manufacturer’s protocols, respectively. In brief, genomic 
DNA samples were randomly fragmented by Covaris with a base-pair peak of 
~150-200 bp for the resulting fragments, and adapters were then ligated to both 
ends of the fragments. The adaptor-ligated templates were purified using 
Agencourt AMPure SPRI beads, and fragments with insert size ~250 bp were 
excised. Extracted DNA was amplified by ligation-mediated PCR, purified and 
hybridized to the SureSelect Biotinylated RNA Library (BAITS) for enrichment. 
Hybridized fragments bound to the strepavidin beads, whereas non-hybridized 
fragments were washed out after 24h. Captured ligation-mediated PCR 
products were subjected to an Agilent 2100 Bioanalyzer to estimate the magnitude 
of enrichment. Each captured library was then loaded on Illunima Genome 
Analyzer II platform (phase I) or Hiseq 2000 (phase II), and paired-end sequen- 
cing was performed with read lengths of 90 bp, which provided at least 50 
average coverage depth for each sample. Raw image files were processed by 
Illumina base-calling software for base calling with default parameters. 

Read mapping and variant detection. Sequence reads in each individual 
were aligned to the human reference genome (NCBI build 37, hg19) using 
BWA" (version 0.5.9-r16). BAM files created by BWA were then processed using 
the GATK" best practice pipeline using Genome Analysis ToolKit (version GATK 
2.8) for re-alignment and variation (SNV and indel) detection. Variations that 
passed VQSR filtering criteria were extracted for the subsequent analyses. 
Candidate causal variant identification. Variants were functionally annotated 
using ANNOVAR”. Missense, nonsense, and splicing mutations, which were 
likely to be deleterious when compared with synonymous and noncoding 
mutations, were extracted for analysis. Variants with a homozygous genotype in 
affected individuals and a heterozygous reference genotype in unaffected indivi- 
duals, and a minor allele frequency <0.5% in both dbSNP137 and 1000 Genome 
Project databases (CEU) were considered as putative causal variants. Then, SIFT, 
Polyphen2, Phylop and Mutationtaster were used for function prediction, and 
mutations predicted to be damaging by no less than two tools were selected. 
Finally, variants that were shared as homozygous mutations within three affected 
children, as a heterozygous mutation in unaffected parents, and as heterozygous or 
homozygous to the reference genotype in unaffected daughter, but were absent in 
the public databases, were then considered as candidate causal variants. 
Whole-genome genotyping. Whole-genome SNP genotyping was performed 
using Illumina HumanOmni 5Exome-4v1-1 array for all six family members. 
Mendelian error rate was calculated to check relative relationship in the family 
as part of quality control. Then, high-quality SNPs were selected for homozygosity 
mapping by HomozygosityMapper (http://www.homozygositymapper.org). 
HomozygosityMapper calculates the length of the homozygous block (in SNPs) 
at each marker for each sample. The values of the ‘cases’ are then added to get the 
‘homozygosity score’ for a marker. 

Mutation screening of LSS and FDFT1 genes. Sanger DNA sequencing was 
performed to validate the G588S mutation in LSS. The 22 exons of the LSS gene 
were amplified by PCR and sequenced on the Genetic Analyzer 3130 (Applied 
Biosystems). The primers used to amplify the exons in LSS are presented in 
Extended Data Table 3a. We screened for mutations in the LSS gene in 154 families 
with congenital cataracts and identified another homozygous mutation, W581R, 
in a second consanguineous family. These two mutations were absent in 11,000 
control chromosomes, including 2,000 chromosomes from an unaffected control 
population at the University of California, San Diego and the 1000 Genomes 
Project, and 8,000 chromosomes from an exome sequencing database at the 
University of Washington. 

Due to a previous report that a FDFT1 mutation modifies cataract phenotypes, 
we screened variants in the FDFT1 gene, identifying only one common non- 
synonymous variant rs4731 (GRch37/hg19: chr8:11666337; NM_001287742.1: 
c.134A > G, NM_001274671.1:p.K45R). The variant was excluded as the causal 
mutation since an unaffected daughter harboured the same homozygous change, 


and a relatively high frequency of general population possess this variant (minor 
allele frequency >4% in 1000 Genome Project data) (Extended Data Table le). 
3D modelling of the G588S mutation. The model of the G588S mutant was built 
from two structures as determined by Ruf et al.”° and deposited in the Protein Data 
Bankas entries 1W6K and 1W6J’’. The X-ray coordinates were used to build a full- 
atom model of the enzyme, and it was refined using the Internal Coordinate 
Mechanics program (ICM) and its PDB conversion protocol’'. To analyse the 
effect of the G588S-mutation-induced clash on lanosterol binding, we analysed 
all side chains involved in the pocket of the enzyme interacting with lanosterol 
using the 1W6K structure. The areas of contact were calculated as the differences 
between the solvent-accessible area of each residue with and without lanosterol 
and were sorted by size using the ICM program”. 

Plasmid constructs and site-directed mutagenesis. The clone containing LSS 
cDNA was purchased from Thermo Scientific Inc. The coding sequence of wild- 
type LSS was cloned and inserted into the pcDNA3.1-N-Flag plasmid (Invitrogen). 
The mutants were constructed via site-directed mutagenesis by overlap extension 
using PCR. The common PCR primers were: NdeI forward, 5'-CATATGACGG 
AGGGCACGTGTCT-3’ and Xhol reverse, 5’-CTCGAGTCAGGGGTGGCCA 
GCAAG-3". The primers for constructing the W581R and G588S mutants were: 
W581R_ forward, 5’-TGGGAAGGCTCCCGGGGAGTTTGCT-3’; _ reverse, 
5'-GIGAAGCAAACTCCCCGGGAGCCTIC-3’;,  G588S__— forward, 5’- 
GCTTCACCTACAGCACCTGGTTTG-3’; G588S reverse, 5’-CCAAACC 
AGGTGCIGTAGGTGAAG-3’. The recombinant pcDNA3.1-N-Flag plasmids 
containing the wild-type or mutated LSS genes were transformed into E. coli 
DH5« cells. The cDNA of «A-, «B-, BB2-, yC- and yD-crystallin were cloned 
from the total cDNA of human lens as described previously *°. The mutants 
were constructed by site-directed mutagenesis using the primers listed in 
Extended Data Table 3b. The amplified fragments were digested by XhoI and 
BamHI, and then inserted into the eukaryotic expression vector peGFP-N1 or the 
prokaryotic expression vector pET28a. The plasmids were obtained using the 
Plasmid Maxiprep kit (Vigorous) and verified by DNA sequencing. Crystallin 
gene constructs were made as a C-terminus eGFP fusion protein, while LSS was 
made as an N-terminal Flag-tagged protein. 

Cell culture and transfection. HeLa cells and human lens epithelial B-3 cells 
(HLEB-3) were obtained from ATCC. Human lens progenitor cells were isolated 
froma fetal human eye”’. The HeLa cells were cultured in DMEM medium contain- 
ing 10% FBS (Gibco). The HLEB-3 cells were cultured in F12 medium with 20% 
FBS, while human lens progenitor cells were cultured in MEM medium containing 
20% FBS and 10 pg ml | FGF (Gibco). All cells were cultured at 37 °C in 5% CO, 
incubator. Cells routinely tested negative for mycoplasma contamination. 

To assess the effect of LSS expression on sterol content, HeLa cells were trans- 
fected with wild-type LSS or LSS mutants fused with a Flag tag at the N-terminus of 
the coding region. The cells were harvested after 24h transfection and the lipid 
fraction was extracted for LC-MS analysis. Cells transfected with the vector 
pcDNA3.1-N-Flag plasmids were used as a control. The expression levels of the 
wild-type and mutant LSS were normalized by western blot analysis using mouse 
anti-Flag (F1804; Sigma-Aldrich) and mouse anti-actin antibodies (BS6007M; 
Bioworld Technology). 

To assess the effect of lanosterol on crystallin aggregation, human lens progen- 
itor cells were co-transfected with LSS and various crystallin constructs for 4 h. Cells 
co-transfected with crystallin mutants and pcDNA3.1-N-Flag were used as a con- 
trol. Human lens progenitor cells co-transfected with LSS and crystallin mutant 
constructs were cultured for 12 h before assaying for aggregates. The rescue experi- 
ments were performed after 16h by addition of 40 1M sterols (lanosterol or cho- 
lesterol, Sigma-Aldrich) to the cell culture medium for 2 h, which was then replaced 
with fresh culture medium and cells cultured for 24h. The percentage of cells with 
crystallin aggregates was calculated from ten randomly selected viewing fields. The 
values of the wild-type LSS group, mutant group, and mutant plus lanosterol group 
were calculated. Cells treated with 1% DMSO were used as the controls. 

The impact of LSS and lanosterol on intracellular crystallin aggregation were 

evaluated in single-blinded observer studies. Experiments have been repeated at 
least three times. P values were calculated using Student's -tests. 
Fluorescence microscopy. Equal amounts of the human lens progenitor cells, 
HLEB-3 cells or HeLa cells were seeded on glass coverslips pretreated with TC 
(Solarbio). After culturing for 24h to reach 90% confluency, the cells were trans- 
fected with plasmids containing various LSS or crystallin genes or co-transfected 
with plasmids containing a certain crystallin gene and those containing the wild- 
type or mutated LSS gene. The controls were cells transfected with the 
plasmids containing peGFP-N1 and/or peDNA3.1-N-Flag. Both transfection 
and co-transfection were performed using Lipofectamine 3000 (Invitrogen) 
according to the instructions from the manufacturer. 

The effect of wild-type or mutated LSS on the intracellular aggregation of 
various cataract-causing crystallin mutants was evaluated by co-expression of 
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Flag-LSS and crystalline-GFP in the human lens progenitor cells, HLEB-3 cells or 
HeLa cells. The intracellular distributions of the proteins were visualized using 
GFP or antibody against Flag. After co-transfection for 4h, the cells were cultured 
in fresh media for 24h, and then analysed by microscopy. 

The effect of lanosterol or cholesterol on the aggresome formation of various 
crystallins was studied by transfecting the cells with plasmids containing various 
crystallin genes. The cells were incubated for 24h to enable efficient protein 
expression and aggresome formation. The cells were then treated with 0-40 1M 
sterols in 1% (for human lens progenitor cells) or 2% DMSO (for HeLa cells). Cells 
treated with 1% or 2% DMSO were used as the control. After treatment for 2 h, the 
media was replaced with fresh media. After 12h, the cells were used for micro- 
scopy analysis. 

The microscopy samples were prepared by washing the slips with phosphate 
buffered saline (PBS) three times. The cells were fixed with 4% paraformaldehyde 
for 40 min followed by another three washes with PBS. The cells were permeabi- 
lized with 0.1% Triton X-100 (Sigma) in PBS for 10 min and blocked with 5% 
normal goat serum in PBS for 1h at 37 °C. Immunostaining was carried out by 
adding mouse anti-Flag antibody (1:500) or mouse anti-p62 antibody (1:200, 
ab56416; Abcam) in PBS buffer containing 5% normal goat serum and incubated 
for 1h at 37°C. Then the slips were washed three times with PBS, and further 
incubated with Alexa 649-conjugated goat anti-mouse IgG (1:250) for 1h at 
ambient temperature. The nuclei were counterstained with Hoechst 33342 
(Invitrogen). The mounted cells were analysed using a Carl Zeiss LSM 710 con- 
focal microscope. 

Live-cell imaging. Human lens progenitor cells were transfected with plasmids 
containing «A-crystallin(Y118D) mutant. After a 24h transfection period, the 
cells with stable expression of «A-crystallin(Y118D) mutant were screen by 
incubation in culture medium containing 0.8 pg ml’ G418 for 7 days. Then the 
obtained cells were seeded onto glass bottom cell culture dishes (In Vitro 
Scientific) and treated with 1% DMSO, 40M cholesterol in 1% DMSO or 
40 uM lanosterol in 1% DMSO for 4h. Fresh culture medium was added, and 
the cells were analysed by serial live-cell imaging. Live-cell images were viewed 
with an Olympus IX81 microscope and captured with CellSens Dimension soft- 
ware (Olympus). Quantitative analysis of the size of aggregates was performed by 
measuring the fluorescence intensity of p62-positive aggregates using single-par- 
ticle tracking in live-cell imaging. The live-cell imaging was conducted using three 
biological replicates with 1-8 repetitions each. 

Lipid extraction of the cells. Extraction of lipids was performed using the Bligh 
and Dyer method”. In brief, ~1 X 10°-10’ HeLa cells were washed 3-5 times with 
PBS and then scraped in 400-1 ice-cold methanol and transferred to a 1.5 ml 
Eppendorf tube with the addition of 200 1l chloroform. The samples were vortex- 
agitated for 1 min and then mixed with 300 ul of 1M KCl. The organic and 
aqueous phases were separated by microcentrifugation at 20,817. for 5 min at 
4°C. After separation, the lower organic phase was collected. Then the residual 
aqueous phase was re-extracted twice using 300 ul chloroform. The collected 
organic phases were dried using a SpeedVac sample concentrator under vacuum. 
The dried samples were stored at — 80°C for further LC-MS analysis. 

LC-MS analysis. The dried lipid extracts were re-suspended in 100 jl methanol. 
The samples were vortex-agitated for 10 min, treated by 80 W ultrasonic sonica- 
tion for 30 min, microcentrifuged at 20,817g for 10 min, and then the supernatant 
was transferred to a new Eppendorf tube. The microcentrifugation treatment was 
repeated three times. The derived samples were analysed by an Agilent 1290/6460 
triple quadrupole LC/MS using an alternative Atmospheric Pressure Chemical 
Ionisation (APCI) source. The lipids were separated using an Agilent SB-C18 
column. Selective ion monitoring was performed using the electron ionization 
mode. The highly pure lanosterol and cholesterol were used as controls. The 
MS determination was performed using a gas temperature of 350°C, a gas flow 
rate of 4] min” ', a nebulizer of 60 p.s.i., a vaporizer of 350 °C, a capillary of 3,500 V 
and a corona current of 411A. To optimize the sensitivity and specificity, two 
qualifier ions were selected for the MS analysis of each compound (369.3/161.1 
and 369.3/147 for cholesterol, and 409.2/191.3 and 409.2/109 for lanosterol). 
Western blotting. The cell lysates were prepared in RIPA buffer containing 
50 mM Tris (pH 8.0), 150 mM NaCl, 1% Triton X-100, 1 mM EDTA, 0.5% sodium 
deoxycholate and 0.1% SDS. The supernatant and precipitation fractions were 
separated by centrifugation. The proteins were separated by a 12.5% SDS- 
PAGE and transferred to a PVDF membrane (GE Healthcare). The mouse anti- 
bodies against Flag (F1804; Sigma-Aldrich) or GFP (MB2005; Bioworld 
Technology) were used to identify the overexpressed LSS and crystallin proteins, 
respectively. Quantification of the western blot bands was achieved using the 
software GELPRO (Media Cybernetics). The presented quantitative data were 
calculated from three independent experiments. 

Protein expression and purification. The recombinant His-tagged wild-type and 
mutated B- and y-crystallin proteins were overexpressed in Escherichia coli 
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Rosetta and purified using an Ni-NTA affinity column followed by gel filtration 
chromatography using the same protocol as described elsewhere****”°”*. The over- 
expression and purification of the non-tagged «A- and oB-crystallins were per- 
formed as described previously*. The purity of the proteins was estimated to be 
above 95% as evaluated by one homogeneous band on 12.5% SDS-PAGE, 10% 
native-PAGE and a single peak in the size-exclusion chromatography profile. The 
protein concentration was determined according to the Bradford method by using 
BSA as the standard*’. All protein samples were prepared in 20 mM PBS buffer 
containing 150 mM NaCl, 1 mM EDTA and 1 mM DTT. 

Protein aggregation and aggregate dissociation. The aggregates of the wild-type 
and mutated #A- and «B-crystallin proteins were obtained by heating the protein 
solutions containing 1 M guanidine chloride (ultrapure, Sigma-Aldrich) at a con- 
centration of 5mgml' at 60°C for 2h. The aggregates of the wild-type and 
mutated B- and y-crystallins were prepared by heating the protein solutions con- 
taining 1 M guanidine chloride at 37 °C for 48 h. The formation of aggregates was 
confirmed by ThT fluorescence, turbidity (absorbance at 400 nm) and transmis- 
sion electron microscopy (TEM) observations. The preformed aggregates were 
re-suspended in 20 mM PBS with a final concentration of 0.2 mg ml * (approxi- 
mately 10 1M). The re-suspended aggregates were treated with 500 tM lanosterol 
or cholesterol in liposomes formed by 500 uM DPPC (Sigma-Aldrich) at 37 °C. 
Aggregates treated by 500 1M DPPC liposome were used as a negative control. 
After 24h of treatment, the protein solutions were used for ThT fluorescence, 
turbidity and negatively stained TEM observations. The TEM samples were pre- 
pared by depositing the protein solutions onto a freshly glow-discharged carbon- 
coated copper grid. Negative-staining samples were obtained by staining the grid 
with 1.25% uranyl acetate for 30s. The negatively stained TEM pictures were 
obtained on a Hitachi H-7650B transmission electron microscope with a voltage 
of 120kV and a magnification of 48,000. 

Treatment of cataractous rabbit lenses. This study was approved by IACUC of 
Zhongshan Ophthalmic Center and West China Hospital. Rabbits were eutha- 
nized by CO, inhalation and lenses were immediately dissected and treated with 
vehicle or lanosterol dissolved in vehicle to make 25 mM solutions. Lens tissues 
were incubated in these solutions for 6 days in the dark at room temperature. 
Cataracts were examined under a microscope and photographed. Degree of cat- 
aract was assessed by a blinded examiner using a previously described opacifica- 
tion grading system, shown below?**. Improvements in lens clarity and 
transparency were quantified by visual inspection and grading. Lens clarity was 
scored by transmission of light, clarity of a grid image underneath the lense 
(Extended Data Fig. 7), and improvement in overall clarity of a lens or improve- 
ment in clarity of localized areas of cortical cataract. Wilcoxon test was used to 
evaluate the treatment effect. 

Cataract grading system. 

* Grade 0: absence of opacification (gridlines clearly visible); 

* Grade 1: a slight degree of opacification (minimal clouding of gridlines, with 
gridlines still visible); 

* Grade 2: presence of diffuse opacification involving almost the entire lens (mod- 
erate clouding of gridlines, with main gridlines visible); 

* Grade 3: presence of extensive, thick opacification involving the entire lens (total 
clouding of gridlines, with gridlines not seen at all) 

Preparation of drug-loaded nanoparticles. Lanosterol was loaded into a lipid- 
polymer hybrid nanoparticle through an adapted nanoprecipitation method**. In 
brief, the desired concentration of lanosterol was mixed with polycaprolactone 
(PCL) polymer dissolved in acetonitrile. Lecithin and 1,2-distearoyl-sn-glycero-3- 
phosphoethanolamine-N-carboxy(polyethylene glycol) 2000 (DSPE-PEG-COOH) 
were dissolved in a 4% ethanol aqueous solution at 20% of the PCL polymer weight 
and heated above 60°C. The lanosterol/PCL solution was then added into the 
preheated lipid solution under gentle stirring followed by rigorous vortexing for 
3 min. The mixture solution was then stirred for 2h to allow the nanoparticles to 
form and the acetonitrile to evaporate. Next, the nanoparticle solution was washed 
three times using an Amicon Ultra-4 centrifugal filter (Millipore) with a molecular 
weight cut-off of 10 kDa to remove the remaining organic solvent and free mole- 
cules. The resulting nanoparticles were then re-suspended in PBS buffer for sub- 
sequent use. The size, size distribution, and surface zeta potential of the drug-loaded 
nanoparticles were characterized by dynamic light scattering. The loading yield of 
lanosterol was quantified by high-performance liquid chromatography. 
Treatment of cataractous lenses in dogs. This study was approved by IACUC of 
Zhongshan Ophthalmic Center and West China Hospital. The following adult dog 
breeds were used for assessing the treatment effect: black Labrador, Queensland 
Heeler, Miniature Pinscher. All dogs were adult, non-diabetic and had normal 
ocular surfaces and ocular adnexa, with naturally occurring adult onset cataracts. 
There were near equal distributions of male and female dogs. We screened all 
exons of the LSS gene in these dogs and did not find any mutations. To assess the 
effect of lanosterol treatment on cataracts in live animals, dogs were pre-medicated 
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with intramuscular injections of acepromaxine and butorphanol. After 20 min, 
induction of anaesthesia was performed by application of intravenous propofol. 
Dogs were then immediately intubated and maintained on oxygen and 2% iso- 
flurane at 21min~'. Lanosterol (100 p1g)-loaded nanoparticles were initially 
injected into the vitreous cavity in the test eye using a 28-gauge needle, and then 
were given every 3 days for the duration of the experiment. Treatment eyes or 
sham eyes were randomized. The control eye was given an injection with empty 
nanoparticle carriers as a negative control. The treatment eyes were treated with 
lanosterol in topical eye drops (see below for eye drop formulation). One 50-11 
drop of lanosterol was administered three times daily to the test eye over 6 weeks. 
Degree of cataract severity was examined by slit lamp and photographed at the 
beginning and the end of the 6-week treatment period. Prior to examinations, 
pupils were dilated with 1% tropicamide and 10% phenylephrine. Degree of cat- 
aract severity was assessed by a blinded examiner and scored based on canine 
cataract stage, shown below*’. Improvements in lens clarity and transparency were 
quantified. Wilcoxon test was used to evaluate the treatment effect. 

Grading system of canine cataracts. 

* Grade 0: absence of opacification (no cataract); 

* Grade 1: a slight degree of opacification (incipient stage); 

* Grade 2: presence of diffuse opacification involving almost the entire lens (imma- 
ture stage); 

* Grade 3: presence of extensive, thick opacification involving the entire lens 
(mature stage) 

Topical vehicle solution. Double distilled H,O was added to 1.1 g (EDTA),Na 
combined with 0.055g alkyldimethylbenzylammonium chloride until a final 
volume of 1.11 (pH 5.66) was achieved. 

25 mM lanosterol in the topical vehicle solution. Double distilled H,O was 
added to a mixture of 12.5 g lanosterol, 1.1 g (EDTA) Na, 0.055 g alkyldimethyl- 
benzylammonium chloride and 200 ml EtOH to a final volume of 1.11. 
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Extended Data Figure 1 | Genome-wide homozygosity. a, Homozygosity- the physical position on chromosome 21, which contains the LSS gene. Red bars 
Mapper plots the genome-wide homozygosity as bar charts. To emphasize indicate regions with highest scores. The right side of the chromosome contains 
regions of interest, any score higher than 80% of the maximum score reachedin _a long continuous homozygous region, where the LSS gene is located. 

this project is coloured in red. b, The homozygosity scores were plotted against 
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Extended Data Figure 2 | Representative confocal images of cells distribution of LSS was then visualized using an anti-Flag antibody (purple). 
co-transfected with Flag-LSS and eGFP. Human lens progenitor cells were The distribution of eGFP (green) was used as a control. The nuclei were stained 
co-transfected with either the wild-type or the mutated LSS gene and the eGFP and visualized by Hoechst 33342 (blue). 

gene for 4h and cultured for 16h in fresh culture medium. The cellular 
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Extended Data Figure 3 | Representative confocal images of cells co- 
transfected with LSS and various cataract-causing crystallin mutants. 

a, R116C mutant of «A-crystallin. b, R120G mutant of «B-crystallin. c, V187E 
mutant of BB2-crystallin. c, G129C mutant of yC-crystallin. e, W43R mutant 
of yD-crystallin. Human lens progenitor cells were co-transfected with either 
the wild-type or the mutated Flag—LSS gene and the mutant GFP-crystallin 
gene for 4h and cultured for 16h in fresh culture medium. All crystallin 
mutants formed p62-positive aggregates as indicated by the co-localization of 


Hoechst Merge 


the mutant crystallins and p62. Cells co-transfected with GFP-crystallin and 
pcDNA3.1-N-Flag were used as controls. The formation of intracellular 
aggregates of various crystallin proteins was visualized by fluorescence of GFP 
(green). Wild-type or mutated LSS was detected with an anti-Flag antibody 
(purple), p62 was stained using an anti-p62 antibody (red), while the nuclei 
were stained and visualized by Hoechst 33342 staining (blue). Quantitative 
analysis of cells with aggregates is summarized in Fig. 3c. 
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Extended Data Figure 4 | Inhibition of crystallin mutant aggregation by aggregates in mutant LSS or cholesterol groups showed no difference to the 
wild-type LSS and lanosterol in HLEB-3 cells (a) or HeLa cells (b). Cells control group (P > 0.1). ¢, Human lens progenitor cells were co-transfected 
co-transfected with LSS and crystallin mutant constructs were cultured for 24h with wild-type or mutant LSS plus «A-crystallin(Y118D). «A-crystallin(Y118D) 
before assaying for aggregates. The rescue experiments were performed by co-expressed with pcDNA3.1-N-Flag was used as a control. After transfection 
addition of 40 1M sterols (lanosterol or cholesterol) to the cell culture medium _ for 4h and incubation in fresh culture medium for another 24h, the cells 

for 2 h, the sterol medium was then replaced with fresh culture medium andthe were lysed and centrifuged to separate supernatant and insoluble fractions. LSS 
cells were cultured for a further 12h. The percentage of cells with crystallin and crystallin fusion proteins were detected by antibodies against Flag and GFP, 
aggregates were calculated from ten randomly selected viewing fields. The respectively. Red arrows indicate higher crystalline content in the soluble 
values of the wild-type LSS group, mutant group, or mutant plus lanosterol fraction versus in the insoluble fraction in cells containing the WT-LSS. 
group were calculated. Aggregates were significantly lower in the wild-type LSS | Data were quantified from three independent experiments and summarized 
and lanosterol groups compared to the control group (P< 1 X 107“), while in Fig. 3d. 
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Extended Data Figure 5 | Lanosterol significantly reduced the intracellular 
aggregation caused by various cataract-causing mutant crystallin proteins 
in a concentration-dependent manner when assayed in HLEB-3 or HeLa 
cells. a, Representative confocal images of HLEB-3 cells transfected with 
various cataract-causing crystallin mutants. b, Representative confocal images 
of HeLa cells transfected with various cataract-causing crystallin mutants. Cells 
were transfected with various crystallin constructs for 4h and cultured for 

an additional 24h in fresh culture medium. Then the cells were treated with 
10, 20 and 40 LM lanosterol in 1% (HLEB-3 cells) or 2% DMSO (HeLa cells) for 
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2h and cultured for another 12 h. Cells treated with 1% (HLEB-3 cells) or 2% 
DMSO (HeLa cells) were used as the controls. Formation of intracellular 
aggregates of various crystallin proteins was visualized by fluorescence of GFP 
(green) and the nuclei were stained with Hoechst 33342 (blue). Typical 
intracellular aggregates are indicated by arrows. c, Concentration dependence 
of the aggregation-dissolving effects of lanosterol when assayed in HLEB-3 
cells. d, Concentration dependence of the aggregation-dissolving effects of 
lanosterol when assayed in HeLa cells. 
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Extended Data Figure 6 | Treatment by lanosterol, but not cholesterol, 
increased cataract-causing mutant crystallins in soluble fractions when 
compared to a control group or a mutant LSS group. a, Human lens 
progenitor cells were transfected with mutant crystallin genes for 4h, and 
then incubated in fresh culture medium for another 24h. The cells were 
harvested and lysed. Supernatant and insoluble fractions were separated by 
centrifugation and analysed by western blot analysis. LSS and crystallin fusion 
proteins were identified by antibodies against Flag and GFP tags, respectively. 
The lanosterol-treated group is highlighted by red boxes. Cells treated with 
1% DMSO were used as a control. B-Actin was used as an internal protein 
loading control of total cell lysates (TCL). S, supernatant; P, insoluble fraction. 
b, Effect of DMSO (n = 4) and cholesterol (n = 7) on the size changes of 
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aA-crystallin(Y118D) aggregates in human lens progenitor cells evaluated by 
single-particle tracking in live-cell imaging. c, Evaluation of the effect of 
lanosterol on the dissolution of crystallin aggregates by turbidity. Crystallin 
aggregates were formed by incubating 5 mgml * protein solution at 60 °C 
for 2h (a-crystallins) or 37 °C for 48h (B- and y-crystallins) in the presence 
of 1 M guanidine chloride. The preformed aggregates were re-suspended in 
PBS at a final protein concentration of 0.2 mg ml ' and were treated with 
500 1M sterols in 500 UM DPPC liposome and incubated at 37 °C for 24h. 
Aggregates treated with 500 1M DPPC liposome only were used as the controls. 
d, Concentration-dependent effect of lanosterol on the re-dissolution of 
amyloid-like fibrils by «A-crystallin mutants evaluated by ThT fluorescence. 
Aggregates treated with 500 1M DPPC liposome only were used as the controls. 
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Extended Data Figure 7 | Grading system of cataractous lenses. a, Lenses 
were placed above a grid and photographed. The degree of transparency was 
scored as 0, a clear lens and absence of opacification (gridlines clearly visible, 
a’); 1, a blurry lens and a slight degree of opacification (minimal clouding of 
gridlines, with gridlines still visible, b’); 2, a cloudy lens and presence of 
diffuse opacification involving almost the entire lens (moderate clouding of 
gridlines, with main gridlines visible, c’); or 3, an opaque lens and presence of 
extensive thick opacification involving the entire lens (total clouding of 
gridlines, with gridlines not seen at all, d’). b, Lanosterol reduced cataract 
severity and increased clarity in isolated cataractous rabbit lenses. Rabbit lenses 
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(n = 13) were dissected and incubated with lanosterol for 6 days and 
subsequently assessed for lens clarity and transparency. Pairs of photographs of 
each cataractous rabbit lens showing before and after treatment with scores 
underneath are shown. c, Lanosterol reduced cataract severity and increased 
lens clarity in dogs. Dog eyes with cataracts (n = 7) were treated with 
lanosterol for 6 weeks and assessed for lens clarity and transparency. A pair 
of photographs of each study eye before and after treatment is shown with 
scores underneath. Three control eyes treated with vehicles alone are also 
presented. 
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Extended Data Table 1 


Exome sequencing and variants 


Total effective Average sequencing Coverage of target 


Fraction of target 


Fraction of target 


a Sample yield(Mb) depth ee region covered >= 4x covered >= 10x 
IV-1 3,409.20 60.16 0.20% 99.60% 99.10% 97.60% 
IV-2 3,314.58 58.62 0.20% 99.60% 99.20% 97.80% 
IV-3 3,327.63 57.24 0.20% 99.80% 99.20% 97.40% 
M-2 3,029.40 51.89 0.21% 99.80% 99.30% 97.70% 
I-41 6,877.08 54.24 0.29% 96.30% 89.40% 81.80% 
IV-4 6,331.78 44.12 0.29% 96.50% 88.80% 79.80% 
b Sample Total variation Heterzygotes Homozygotes missense nonsense _—readthrough +=synonymous splicing intergenic intronic 
IV-1 61,189 35,571 25,618 6,105 69 39 7,296 32 5,371 36,598 
IV-2 60,829 34,698 26,131 6,074 62 41 7,211 38 5,178 36,572 
IV-3 61,078 35,238 25,840 6,221 78 43 7,265 38 5,099 36,544 
Il-2 62,753 39,001 23,752 6,393 64 38 7,588 34 5,764 36,924 
Il-1 80,067 49,694 30,373 7,247 93 49 8,166 47 15,063 41,391 
IV-4 80,893 48,211 32,682 7,252 85 50 8,184 50 14,547 42,414 
c Filters Aes aa eee Ire Is eed Combine 
(carrier father) mother) daughter) (affected son) (affected son) daughter) 
Total variations 80,067 62,753 61,189 60,829 61,078 80,893 - 
Missense, Nonsense, Splicing 7,389 6,495 6,213 6,177 6,342 7,387 - 
pa ae oa ; 5,792 4,661 3,127 3,123 3,085 5,638 9 
Not in dbSNP 3,724 2,969 1,954 1,929 1,928 3,589 5 
Not in 1000 Genomes Project 1,032 767 227 264 245 1,059 1 
Predicted damaging 267 269 31 45 41 264 1 
‘Homozygous in affected child, heterozygous in carrier, no homozygous mutants in unaffected child 
d Sample Total loci Captured SNP 
IV-1 4,641,218 4,440,318 559,832 
IV-2 4,641,218 4,446,992 605,499 
IV-3 4,641,218 4,445,267 526,794 
Ml-2 4,641,218 4,448,054 537,925 
Ill-1 4,641,218 4,446,581 574,880 
IV-4 4,641,218 4,450,657 584,347 
Position ; Ml-4 II-2 IV-1 IV-2 IV-3 IV-4 
e (GRch37/hg19) refSNP REF ALT Function Seodlsed (carrier (affected (affected (affected (unaffected 
father) mother) daughter) son) son) daughter) 
chr8:11666337 1s4731 A G nonsynonymous AIG AIG G/G AIG G/G G/G 
chr8:11683653 rs904011 T Cc synonymous cic cic cic cic cic cic 


a, Summary of exome sequencing data production. b, Summary of detected variants. c, Variants prioritization pipeline after exome sequencing. d, Summary of whole-genome genotyping data. e, Coding variants 
detected on gene FDFT1. 
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Extended Data Table 2 | Treatment effect of lanosterol in cataractous rabbit lenses and dog cataracts. 
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Grading of the cataract severity was conducted by an examiner, blinded regarding the 
treatment status, based on a scale 0 to 3, as described in the Methods section. 


b Study eye 
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Grading of the cataract formation was conducted by an examiner, blinded regarding the 
treatment status, based on a scale 0 to 3. 0 = no cataract, 1 = incipient, 2 = immature, and 3 


= mature. 


a, Treatment effect of lanosterol in cataractous rabbit lenses. Grading of the cataract severity was conducted by an examiner, blinded to treatment status, on a scale from 0 to 3, as described in the Methods. 
b, Treatment effect of lanosterol in cataractous dog lenses. Grading of the cataract formation was conducted by an examiner, blinded to treatment status, on a scale from O to 3. 0 = no cataract, 1 = incipient, 


2 = immature, and 3 = mature. 
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Extended Data Table 3 | Primers used for sequencing of each exon in the human LSS gene and construction of crystallin mutants 


a Amplicon Sequence (5’-3’) 
LSS-Exon1-F GCCTGAGCGCCTGCCGAGGCCT 
LSS-Exon1-R GACACCTGAGGACCACCGGCCAT 
LSS-Exon2-F GTGGTCCTAGGTGCTGAGGAGA 
LSS-Exon2-R CGTGCTCCTCACGGCTCACCCCT 
LSS-Exon3-F CTTGGGCTGTATGTGAAGAGGGT 
LSS-Exon3-R CCTAGACCAGGCTGGGCCAGGAT 
LSS-Exon4-F GTTGGAGTGAGGTGCTCAGGAGGA 
LSS-Exon4-R GCAGCTGCCTGGAAACCCAAGCAT 
LSS-Exon5-F GCATTCTTAGTTTTCTGAGGAAACTC. 
LSS-Exon5-R CCACTGTTTCAGCTGCAAGTGCAT 
LSS-Exon6-F CAGAGGGTGAAGCTTCCCAGCT 
LSS-Exon6-R GCTGTCACAGCCTGCACCTGAG 
LSS-Exon7-F GAAAGGGCCCAAGGTATGGATGCT 
LSS-Exon7-R GTGAGTGGACAGGTGTGGTTAGAT 
LSS-Exon8-F GAGCCAGGCCTACCAGGTGCT 
LSS-Exon8-R GCAGGGGATGAGTGCGTGAAT 
LSS-Exon9-F GCAGTGCATGGAGCTCCAGGCT 
LSS-Exon9-R CCAGGAAACCCCACTCCCAGCT 

LSS-Exon10-F GTGGATCTGGACGAGACCTTGT 
LSS-Exon10-R CACTGGGATGCAGCTGGGGCT 
LSS-Exon11-F GTGCAGGGTCTGGGTAGCAGCT 
LSS-Exon11-R GACATGATTGCAAAGGAAGCAT 
LSS-Exon12-F CTGGAGGCAGTGGCTGGGAGT 
LSS-Exon12-R GCAAGTGTGTGGCCAGCAGTGCT 
LSS-Exon13-F GGCAGGATGTGGCCAGGACCAT 
LSS-Exon13-R GCACTTCTGCCTGCAGGAGCT 
LSS-Exon14-F CCAGTCTGTCTCAGCGATGT 
LSS-Exon14-R CCAAAAACGCCAAGGGAGGAGT 
LSS-Exon15-F CTGGCTGCACCCACACCTTTGGT 
LSS-Exon15-R GCTCATCTGCAGGACACGAGGT 
LSS-Exon16-F GTTGTCAGCCCTAGTGTTGCCT 
LSS-Exon16-R CAGGTTTGTGTACCACAGTGCT 
LSS-Exon17-F GAGCTGCAGAGCCTGGGCAGCCA 
LSS-Exon17-R CCGTGTCACAGAATGATGCGT 
LSS-Exon18-F GAATTGGGATAGGTAAACTGCT 
LSS-Exon18-R CGCAGTGTGTGAGAGCAGAAACCT 
LSS-Exon19-F CTTAATGCCTGAGGCACTGGAGT 
LSS-Exon19-R CACTCATGACAGAGCATTGGGTT 
LSS-Exon20-F CAAGGCAGCCTGCTGGGGTGA 
LSS-Exon20-R CACCGGCTCACAGCTGAGTGT 
LSS-Exon21-F CTCACTGCAGCATTCCAGGGTT 
LSS-Exon21-R GTGGAAACAGCCATGCACGCT 
LSS-Exon22-F GCCAACAGCCAGGGCTCCAGTT 
LSS-Exon22-R GGTTGGAGCCCAAGACAGGGT 

b Gene Primer (5'-3’) 

a@A-R116C-For TTCCCGTGAGTTCCACTGCCGCTACCGCCTGCCGTCGCTGC. 
aA-R116C-Rev CGGCAGGCGGTAGCGGCAGTGGAACTCACGGG 
a@A-R116H-For TTCCCGTGAGTTCCACCACCGCTACCGCCTGCCGTCGCCAC 
a@A-R116H-Rev CGGCAGGCGGTAGCGGTGGTGGAACTCACGGG 
aA-Y118D-For GAGTTCCACCGCCGCGACCGCCTGCCGTCCAACTTACGAC 
aA-Y118D-Rev CGTTGGACGGCAGGCGGTCGCGGCGGTGGAACT 
aB-R120G-For CAGGGAGTTCCACGGGAAATACCGGATAGGGGG 
aB-R120G-Rev GGATCCGGTATTTCCCGTGGAACTCCCT 
#B2N/187E-For AGGTGCAGTCCGAGCGCCGTATGTGGAG 
B2-N/187E-Rev ATACGGCGCTCGGACTGCACCT 
#B2-\N/187M-For AGGTGCAGTCCATGCGCCGTATGTGATG 
B2-V187M-Rev ATACGGCGCTCGGACTGCACCT 
/#B2-R188H-For TGCAGTCCGTGCACCGTATCCCGCCAC 
B2-R188H-Rev GGATACGGTGCACGGACTGCA 
yC-G129C-For CACGTGCTGGAGTGCTGCTGGGCTGC 
yC-G129C-Rev CAGCAGCACTCCAGCACGTG 
yD-W43R-For GTGGACAGCGGCTGCCGGATGCTCTATGAGCTGGCGG 
yD-W43R-Rev GCTCATAGAGCATCCGGCAGCCGCTGTCCAC 


a, Primers used for PCR amplification and sequencing of each exon in the human LSS gene. b, Primers used in construction of crystallin mutants. 
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T-cell exhaustion, co-stimulation and clinical 
outcome in autoimmunity and infection 


Eoin F. McKinney’, James C. Lee!”, David R. W. Jayne’, Paul A. Lyons? & Kenneth G. C. Smith!” 


The clinical course of autoimmune and infectious disease varies 
greatly, even between individuals with the same condition. An 
understanding of the molecular basis for this heterogeneity could 
lead to significant improvements in both monitoring and treat- 
ment. During chronic infection the process of T-cell exhaustion 
inhibits the immune response, facilitating viral persistence’. Here 
we show that a transcriptional signature reflecting CD8 T-cell 
exhaustion is associated with poor clearance of chronic viral infec- 
tion, but conversely predicts better prognosis in multiple auto- 
immune diseases. The development of CD8 T-cell exhaustion 
during chronic infection is driven both by persistence of antigen 
and by a lack of accessory ‘help’ signals. In autoimmunity, we find 
that where evidence of CD4 T-cell co-stimulation is pronounced, 
that of CD8 T-cell exhaustion is reduced. We can reproduce the 
exhaustion signature by modifying the balance of persistent stimu- 
lation of T-cell antigen receptors and specific CD2-induced co- 
stimulation provided to human CD8 T cells in vitro, suggesting 
that each process plays a role in dictating outcome in autoimmune 
disease. The ‘non-exhausted’ T-cell state driven by CD2-induced 
co-stimulation is reduced by signals through the exhaustion-assoc- 
iated inhibitory receptor PD-1, suggesting that induction of 
exhaustion may be a therapeutic strategy in autoimmune and 
inflammatory disease. Using expression of optimal surrogate mar- 
kers of co-stimulation/exhaustion signatures in independent data 
sets, we confirm an association with good clinical outcome or res- 
ponse to therapy in infection (hepatitis C virus) and vaccination 
(yellow fever, malaria, influenza), but poor outcome in autoimmune 
and inflammatory disease (type 1 diabetes, anti-neutrophil cytoplas- 
mic antibody-associated vasculitis, systemic lupus erythematosus, 
idiopathic pulmonary fibrosis and dengue haemorrhagic fever). 
Thus, T-cell exhaustion plays a central role in determining outcome 
in autoimmune disease and targeted manipulation of this process 
could lead to new therapeutic opportunities. 

In a complex set of data such as the transcriptome, similar measure- 
ments may be grouped together by network analysis to form discrete 
modules that can highlight novel pathways contributing to the patho- 
genesis of complex diseases. We have previously shown that a CD8 
T-cell transcriptional signature in patients with multiple immune- 
mediated diseases can predict a subsequent relapsing disease’. 
However, the biology underlying this observation was not clear. We 
therefore applied weighted gene co-expression network analysis 
(Extended Data Fig. 1) to the transcriptomes of purified CD4 and 
CD8 T cells isolated from a prospective cohort of 44 patients with 
anti-neutrophil cytoplasmic antibody-associated vasculitis (AAV) 
having active, untreated disease* (Supplementary Table 1) to further 
explore the mechanisms driving relapsing autoimmunity. Modules of 
genes (Fig. la, rows) were summarized as ‘eigengene’ profiles (Fig. 1b, 
f) that were correlated with clinical variables (Fig. 1a, i, columns) and 
visualized in the form of a heat map (Fig. 1a, i). Modules derived both 
from CD8 (Fig. la-d) and from CD4 (Fig. 1f-i) T-cell transcriptomes 
showed strong correlation with disease outcome but not activity, and 


were co-correlated (Fig. le) despite being mutually exclusive 
(Supplementary Table 2). A similar analysis using a cohort of 23 
patients with systemic lupus erythematosus (SLE) also presenting with 
active, untreated disease (Supplementary Table 3) identified analog- 
ous CD8 and CD4 T-cell expression modules (Extended Data Fig. 2) 
that again correlated with clinical outcome but not disease activity. By 
contrast a type 1 interferon (IFN) response signature was associated 
with disease activity but not with long-term outcome (Extended Data 
Fig. 2f), consistent with previous reports*. 

Next, we reasoned that genes within co-correlated modules in 
related cell types might inform the biology of relapsing disease. By 
selecting CD4 T-cell modules showing significant, strong correlation 
with relapse rate and performing network enrichment analysis, we 
identified a module corresponding to CD4 T-cell co-stimulation 
(Extended Data Figs 1f, g and 3a and Supplementary Tables 2 and 
3). By way of validation, we repeated this analysis using an independ- 
ent co-expression network algorithm that similarly demonstrated 
association between a CD4 co-stimulation module and clinical out- 
come (Supplementary Table 5). The independent association of modu- 
lar signatures with clinical outcome (Fig. la, i) was confirmed using 
multiple linear regression modelling (Extended Data Fig. 3b-e) and 
was only apparent during active disease (Extended Data Fig. 3f and 
Supplementary Discussion). 

During chronic viral infection, CD8 T-cell memory responses are 
exquisitely dependent on CD4 T-cell co-stimulation®*, which can lead 
to the resolution of chronic infection both in mice’ and in humans’. 
When antigen persists in the absence of co-stimulation, CD8 T cells 
become ‘exhausted’, a phenotype characterized by progressive loss of 
effector function, persistent high expression of inhibitory receptors 
and profound changes in gene expression, distinct from those seen 
in effector, memory or anergic T cells*. Although mice lacking inhib- 
itory receptors have an increased incidence and severity of autoimmu- 
nity”’®, a specific role for exhaustion in dictating the outcome of 
autoimmune responses has not been demonstrated. 

We hypothesized that CD4 T-cell signals may be important in lim- 
iting exhaustion towards persistent self-antigen during autoreactive 
immunity, analogous to responses during persistent infection. We 
therefore used gene set enrichment analysis (GSEA”’) to test for altered 
expression of transcriptional signatures reflecting T-cell exhaustion 
(and other T cell-related phenotypes) between subgroups of patients 
defined by the CD8 modular analysis, who go on to develop relapsing 
or quiescent autoimmunity (Fig. 2a). Using this approach, we observed 
that genes specifically downregulated in exhausted CD8 T cells during 
chronic murine lymphocytic choriomeningitis virus (LCMV) infec- 
tion (but not altered in memory, naive or effector cells; Supplementary 
Table 6 (ref. 8)) were similarly downregulated in CD8 T cells from 
patients at low risk of subsequent relapse (Fig. 2b and Extended Data 
Figs 3g-i and 4). 

During chronic murine LCMV infection, T-cell exhaustion is 
driven by coordinate upregulation of multiple co-inhibitory recep- 
tors’* that signal synergistically to produce a state of generalized 
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Figure 1 | Weighted gene co-expression network analysis of the T-cell 
transcriptome and its correlation with clinical phenotype in AAV. a, i, Heat 
maps illustrating the correlation of CD8 (a) and CD4 (i) co-expression 
modules (coloured blocks, y axis) with clinical traits in AAV (n = 44). 
Prognostic’ and random signature overlap with modules shown (a, right) 
(overlap = signature genes/module genes, as a percentage). b, f, Linear plots 
illustrating turquoise (b) and black (f) modules and summary eigengenes; 

y = expression (log,(ratio)), x = samples. c, d, g, h, Scatter plots showing 
normalized flare rate (c, g) and disease activity (d, h, Birmingham Vasculitis 
Activity Score (BVAS), y axis) against CD8 turquoise (c, d) or CD4 black 

(g, h) module eigengene expression (x axis). e, Scatter plot showing correlation 
between CD4 T-cell black (x axis) and CD8 T-cell turquoise (y axis) module 
eigengenes. Pearson correlation, r, with P = two-tailed significance. 


immunosuppression’*. In autoimmunity, these receptors were not 
coordinately upregulated as a group. Instead, patients with good pro- 
gnosis from each disease were characterized by upregulation of a dis- 
tinct subset of exhaustion-associated co-inhibitory receptors (Fig. 2c). 
Although a divergence from the murine LCMV model, T-cell exhaus- 
tion accompanied by expression of a limited subset of co-inhibitory 
receptors is similar to that described in intratumoural CD8 T cells’* 
which are a target for checkpoint therapy (Extended Data Fig. 4i)!>"°. 

To confirm whether exhaustion was associated with clinical out- 
come, we used the murine CD8 T-cell exhaustion signature 
(Supplementary Table 6 (ref. 8)) to perform unsupervised hierarchical 
clustering of three independent cohorts of patients with distinct dis- 
eases (AAV, Fig. 2d-f; SLE, Fig. 2g-i; inflammatory bowel disease 
(IBD), Fig. 2j-l). In each case this identified a subgroup of patients 
with both early (Fig. 2e, h, k) and recurrent (Fig. 2f, i, 1) relapses. 
Whereas CD8 exhaustion was associated with poor outcome in viral 
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Figure 2 | A gene expression signature of CD8 T-cell exhaustion predicts 
contrasting outcomes in infection and autoimmune disease. a, Heat map 
showing hierarchical clustering of patients with AAV (n = 44) by expression 
of the turquoise module (Fig. 1b) with corresponding flare rates (flares per 
number of days follow-up, y axis). b, Wind rose plot showing GSEA 
significance (increasing from centre, —logio(false discovery rate (FDR) q 
value)) of CD8 T-cell signatures tested between prognostic subgroups defined 
in a. c, Heat map showing differential expression of exhaustion-associated 
co-inhibitory receptors between prognostic subgroups identified in d, g, j. Blue, 
up; red, down in exhausted; grey, no change (FDR P< 0.05). d, g, j, Heat maps 
showing hierarchical clustering of CD8 T-cell expression data from patients 
with AAV (d, n = 58), SLE (g, n = 23) and IBD (j, n = 58) using a murine CD8 
exhaustion signature’. ‘Exhausted’ (blue) and ‘non-exhausted’ (red) subgroups 
of patients defined from the primary division of the cluster dendrogram. 

e, h, k, Kaplan-Meier curves showing censored flare-free survival and 

(f, i, 1) scatter plots showing flare rate normalized against duration of follow-up 
for subgroups of patients defined in d, g, j for AAV (e, f), SLE (h, i) and IBD 
(k, 1) cohorts. e, h, k, P = log-rank test. a, f, i, 1, P= Mann-Whitney test. 


infection, in every case it predicted favourable prognosis in auto- 
immune and inflammatory disease (Fig. 2d-l). Again, independent 
association with outcome was confirmed using multiple linear regres- 
sion models (Extended Data Fig. 3d, e). Together, these data dem- 
onstrate that a transcriptional signature of relative CD8 T-cell 
exhaustion, similar to that determining outcome in chronic viral infec- 
tion and cancer, is apparent during active, untreated disease in patients 
with favourable long-term outcome in multiple autoimmune and 
inflammatory diagnoses. 

CD8 T-cell exhaustion is characterized by high expression of co- 
inhibitory receptors (such as PD-1 (ref. 12)) and low expression of 
nascent memory markers (such as interleukin (IL)-7R’’) and is pro- 
moted both by the persistence of antigen’* and by a lack of accessory 
co-stimulation®. To understand signals driving exhaustion and out- 
come in autoimmunity, we attempted to recreate the outcome- 
associated transcriptional signatures using variable T-cell antigen 
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Figure 3 | T-cell co-stimulation with CD2 prevents development of an 
exhausted IL-7RloPD1hi phenotype. a, Schematic of the magnetic bead 
system providing variable TCR signal duration/co-stimulation during in vitro 
culture. b-f, Scatter plots illustrating IL-7R expression by cell division in 
unstimulated CD8 T cells (b) and following each of three different co-stimulation 
cultures (c-f), as indicated. g-i, Linear plots showing IL-7Rhi population 
resulting from (g) 36 h (black line) versus 6 d (blue line) anti-CD3/28 stimula- 
tion, (h) 6 d anti-CD2/3/28 (red line) versus 6 d anti-CD3/28 (blue line) and from 
6 d anti-CD2/3/28 with (green line) and without (i, red line) Fe-PDL1. Although 
duration of stimulation varied, all cells were analysed after 6 days of culture. 

j, Heat map showing unsupervised hierarchical clustering of murine CD8 T-cell 
gene expression data®* before (naive, grey), 8 days (effector, green) or 30 days 
(memory, red) after acute or more than 30 days (exhausted, blue) after chronic 
LCMV infection clustered by a CD2 response signature. k, 1, Scatter plots 
showing GSEA enrichment for genes upregulated (red) and downregulated 
(blue) by CD2 in (k) memory versus exhausted and (I) effector versus exhausted 
CD8 T cells. m-o, Heat map showing unsupervised hierarchical clustering of 
AAV (m, 1 = 58), SLE (n, n = 23) and IBD (0, n = 58) CD8 T-cell expression 
data using the CD2 response signature. ‘Exhausted’ (blue) and ‘non-exhausted’ 
(red) subgroups were defined from the major division of the cluster dendrogram. 
Upper bar indicates comparison with subgroups of patients produced using 
the murine LCMV exhaustion signature (as shown in Fig. 2d, g, j). Enrichment by 
GSEA of CD2 signature in autoimmune subgroups with FDR q< 0.1. 


receptor (TCR) signal duration and co-stimulation of primary human 
cells in vitro. We stimulated purified human CD8 T cells using a 
magnetic bead conjugated with antibodies targeting co-stimulatory 
molecules (Fig. 3a) and measured expression of IL-7R and PD-1 
expression (Fig. 3b-i and Extended Data Fig. 5a—g) as markers indi- 
cating an exhausted phenotype. Comparison between persistent (6 
days) and transient (36 h) TCR stimulation showed that IL-7R 
expression returned on a proportion of cells after several divisions 
when the TCR stimulus was removed (Fig. 3c) but failed to do so if 
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Figure 4| A surrogate marker of CD4 co-stimulation in PBMC gene 
expression data correlates with clinical outcome in chronic viral infection, 
vaccination, infection and autoimmunity. a, Scatter plot showing the top 100 
genes ranked by ability to identify CD4 T-cell co-stimulation subgroups in 
PBMC data; x axis, variable importance. b, Kaplan-Meier plots showing 
censored flare-free survival stratified by expression of KAT2B (red, above 
median; blue, below median) in patients with AAV and SLE (n = 37, training 
set) replicated on Affymetrix Gene 1.0 ST and in an independent cohort 

(test set, n = 47); P = log-rank test. c, Line and scatter plots showing serial 
KAT2B expression (n = 54) after therapy for chronic HCV infection giving a 
marked (red, n = 28) or poor response (blue, n = 26); P = two-way analysis 
of variance (ANOVA). d, Box plot showing post-vaccine malaria protection in 
a clinical trial (n = 43) stratified by KAT2B expression (red, above median; 
blue, below median); P = Fisher’s exact test. e, Box plot showing percentage 
protection (black) in vaccinees (n = 28) after seasonal influenza vaccine 
stratified by KAT2B expression; P = Fisher’s exact test. f, Scatter plot showing 
neutralizing antibody titre after YF-17D vaccination, stratified by KAT2B 
expression (f; red, above median KAT2B; blue, below median KAT2B); 

P= Mann-Whitney test. g, Line and scatter plot showing serial KAT2B 
expression throughout dengue infection (n = 78) stratified by progression to 
DHF (red, n = 24) or uncomplicated course (blue, uncomplicated dengue, 
n= 54). x axis, time (days) relative to defervescence. h, Box plot showing the 
percentage of patients with IPF (n = 75) progressing to transplantation/death 
(black) stratified by KAT2B expression (red, above median; blue, below 
median); P = Fisher’s exact test. i-k, Scatter plots showing serial KAT2B 
expression in healthy age-, sex- and HLA-matched controls (i, blue) and in pre- 
T1D cases (n = 5, red), two of which seroconvert to islet-cell antibodies (j, black 
line) and three of which develop T1D (k, black line). 


it persisted (Fig. 3d, g). We then systematically tested whether co- 
stimulatory molecules, identified from the CD4 T-cell network ana- 
lysis described above (Fig. 1i and Extended Data Figs 3a and 5h-k), 
could overcome the effect of persistent TCR stimulation during in vitro 
differentiation. We found that specific co-stimulation with anti-CD2 
(Fig. 3e, h), but not with other stimuli such as IFN-« or anti-CD40, 
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resulted in maintained IL-7R expression, limited upregulation of PD-1 
and enhanced cell survival (Fig. 3e and Extended Data Fig. 5). 

While CD8 exhaustion is known to limit viral control during 
chronic infection, exhausted cells may be restored to useful function 
by blocking inhibitory signalling through PD-1 (ref. 19). Enhancing 
co-inhibitory signals is therefore a logical therapeutic strategy in auto- 
immune disease, aiming to facilitate exhaustion despite high levels of 
co-stimulation that would otherwise be predicted to result in an 
aggressive relapsing disease course. To test this concept, primary 
human CD8 T cells were co-stimulated during persistent TCR signal- 
ling as above (Fig. 3e) in the presence or absence of a bead-bound 
Fc-chimaeric version of the principal PD-1 ligand, PDL-1 (Fig. 3a, f). 
When added to CD2-co-stimulated CD8 T-cell cultures, increased 
PD-1/PDL-1 signalling suppressed differentiation of a non-exhausted 
IL-7R™ subpopulation (Fig. 3f, i). 

To define the phenotype of T-cell exhaustion more robustly, as 
small numbers of surface markers are insufficient, we analysed the 
transcriptome of CD8 T cells exposed to persistent stimulation with 
and without CD2 signalling (Supplementary Table 7). This CD2 res- 
ponse signature characterized exhausted cells but not effector or mem- 
ory subsets (by GSEA; Fig. 3j-l). Consistent with this, patient clusters 
generated using the CD2 response signature recreated subgroups sim- 
ilar to those generated using the murine LCMV CD8 exhaustion sig- 
nature (Figs 2d, g, j and 3m-o). Thus, CD2 signalling during persistent 
TCR stimulation of primary human CD8 T cells prevents the develop- 
ment of transcriptional changes characteristic of exhaustion, recre- 
ating transcriptional signatures associated with opposing outcomes 
in viral infection and autoimmunity. 

To confirm that the transcriptional signatures reflected the develop- 
ment of functional exhaustion in vitro, we showed that cells appearing 
exhausted (IL-7R'°PD-1") also expressed markers of apoptotic 
resistance (BCL2"°), characteristic cytokine patterns (IEN-y'°IL-10") 
and showed diminished survival on re-stimulation (Extended Data 
Fig. 6a-e). There was no evidence of preferential accumulation of 
CD8 T-cell subsets after CD2-induced co-stimulation (Extended 
Data Fig. 6f-h). These data highlight the importance of CD2 signalling 
in limiting the development of CD8 T-cell exhaustion in the face of 
persistent TCR simulation, and provide a starting point for more 
sophisticated attempts to therapeutically exhaust an autoimmune 
response in a targeted fashion. 

We next aimed to independently validate the association between 
the balance of CD4 co-stimulation and CD8 exhaustion with clinical 
outcome using published data sets. Most of these profile unseparated 
peripheral blood mononuclear cells (PBMCs), in which T-cell- 
intrinsic signatures are not readily apparent owing to the confounding 
influence of expression from other cell types*’. We therefore used a 
classification algorithm (randomforests) to identify optimal surrogate 
markers of co-stimulation/exhaustion modules in PBMC from auto- 
immune patients taken concurrently with the T cells described above 
(Fig. 4a). As the CD8 exhaustion and CD4 co-stimulation signatures 
were themselves correlated (Extended Data Fig. 3g-i), it became easier 
to detect their combined signal in PBMC using surrogate markers 
(Fig. 4a and Extended Data Fig. 7). The top-ranked candidate, 
KAT2B, is a transcriptional co-activator known to mediate an anti- 
apoptotic effect under conditions of metabolic stress and to increase 
cellular resistance to cytotoxic compounds. These characteristics, 
along with its high expression in memory and T-follicular helper 
and natural killer (NK) cells (Extended Data Fig. 8), suggest that it 
may mark the development of a durable, persistent T-cell phenotype 
promoting long-lived responses in either infection or autoimmunity. 
The observed association was confirmed both by technical replication 
(using the same samples run on an independent array platform) and by 
independent validation (Fig. 4b). 

To test whether similar associations may be apparent in multiple 
infectious and autoimmune diseases, we directly compared expression 
levels of KAT2B (and of the other top surrogate markers; Extended 
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Data Fig. 9) between clinical subgroups defined within published stud- 
ies for which PBMC expression and linked clinical outcome data were 
available. Where subgroups were not pre-specified, we compared clin- 
ical outcome in groups stratified as having either above- or below- 
median expression of KA T2B (Fig. 4c-k). Hierarchical clustering using 
all top surrogate markers gave similar stratification to that seen using 
KAT2B alone, while, as expected, the separation of subgroups of 
patients varied slightly in different clinical circumstances (Fig. 4c-k 
and Extended Data Fig. 9). 

Combined IFN and ribavirin therapy may result in increased virus- 
specific T-cell responses in chronic hepatitis C virus (HCV) infection, 
although such eradication therapy is successful in only 50% of cases”! 
and in some no change in endogenous immune response is observed”. 
Ina cohort of patients with hepatitis C receiving combination therapy, 
KAT2B expression was progressively induced and showed significantly 
greater induction in those ultimately responding to therapy (Fig. 4c 
and Extended Data Fig. 10a). In a clinical trial of malaria vaccination”’, 
high KAT2B expression identified a subgroup with response rates of 
78%, almost twice that of the low KAT2B expression group (Fig. 4d and 
Extended Data Fig. 10b-d). Moreover, response to vaccination for 
either influenza** (Fig. 4e and Extended Data Fig. 10e-f) or yellow 
fever” (Fig. 4f) could be predicted by stratifying recipients on the basis 
of their expression of KAT2B after exposure to vaccine. Dengue viral 
infection can result in a wide range of clinical manifestations ranging 
from asymptomatic infection or self-limiting fever (uncomplicated 
dengue) to dengue haemorrhagic fever (DHF). Consistent with our 
observations in autoimmunity, we observed that KAT2B expression 
was elevated in patients developing the excessive inflammatory res- 
ponse of DHF (Fig. 4g)”°. 

We next asked whether surrogate detection of T-cell co-stimulation/ 
exhaustion modules could predict progression of other autoimmune 
diseases. Idiopathic pulmonary fibrosis (IPF) is a progressive intersti- 
tial lung disease characterized by both autoantibodies and autoreactive 
CD4 T cells’’. In a cohort of 75 patients with IPF’’, high expression of 
KAT2B predicted subsequent progression to transplantation or death 
(Fig. 4h). We also observed that PBMC Kat2b expression was elevated 
in the murine NOD model of type 1 diabetes (T1D)”, with levels rising 
sharply during the T-cell initiation phase, long before the onset of 
diabetic hyperglycaemia (Extended Data Fig. 10g). In a cohort of sam- 
ples taken prospectively from children at high risk of disease but before 
its onset*®, expression of KAT2B was seen to specifically and progres- 
sively rise (Fig. 4i-k) both in patients who progressed to T1D and in 
those who developed islet-cell autoantibodies. 

We show that the balance between co-stimulatory and co-inhibitory 
signals that shape T-cell exhaustion coincides with opposite clinical 
outcomes during autoreactive and anti-viral immunity. This at once 
allows prediction of outcome during infection and autoimmunity, and 
creates the potential for targeted therapeutic exhaustion of an auto- 
immune response in those predicted to follow an aggressive disease 
course. That this association is apparent in multiple autoimmune and 
inflammatory diseases emphasizes the importance of signals shaping 
T-cell exhaustion in driving risk of relapse or recurrence (prognosis) 
rather than disease susceptibility (diagnosis) or immediate severity 
(disease activity), and suggests that targeted manipulation of these 
processes may lead to new treatment strategies that extend beyond 
the conditions discussed here. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Ethical approval. Ethical approval for this study was obtained from the 
Cambridge Local Research Ethics Committee (reference numbers 04/023, 08/ 
H0306/21, 08/H0308/176) and informed consent was obtained from all subjects 
enrolled. 

Patients with AAV. Fifty-nine patients with AAV attending or referred to the 
specialist vasculitis unit at Addenbrooke’s Hospital, Cambridge, UK, between July 
2004 and May 2008 were enrolled into the present study. Active disease at pre- 
sentation was defined by the BVAS”' and the clinical impression that induction 
immunosuppression would be required. Prospective disease monitoring was 
undertaken monthly with serial BVAS disease scoring’! and full biochemical, 
haematological and immunological profiling followed by treatment with an 
immunosuppressant and tapering dose steroid therapy (Supplementary 
Table 1). At each time-point of follow-up, disease activity was allocated into one 
of three categories defined as follows: (1) flare (at least one major or three minor 
BVAS criteria); (2) low-grade activity (no major and one or two minor BVAS 
criteria); (3) no activity (no major or minor BVAS criteria). 

All disease flares were cross-checked against patient records to confirm clinical 

impression of disease activity and the need for intensified therapy as a result. 
Disease activity scoring was performed by a single investigator (E.F.M.) who 
was blinded to gene expression data at the time of scoring. Additional flares were 
defined in the absence of BVAS scoring if patients attended for emergency invest- 
igation (bronchoscopy, or specialist ophthalmological or ear/nose/throat surgical 
review) that confirmed evidence of active disease. To differentiate between discrete 
flares, clear improvement in disease activity was required in the form of an 
improvement in flare-related symptoms together with a reduction in BVAS score, 
a reduction in markers of inflammation (C-reactive protein, erythrocyte sedi- 
mentation rate) and a reduction in immunosuppressive therapy. 
Patients with SLE. The SLE cohort comprised 23 patients attending or referred to 
the Addenbrooke’s Hospital specialist vasculitis unit between July 2004 and May 
2008 who met at least four American College of Rheumatology SLE criteria’, 
presenting with active disease (defined below) and in whom immunosuppressive 
therapy was to be commenced or increased. After treatment with an immunosup- 
pressant, patients were followed up monthly. Disease monitoring was undertaken 
with serial British Isles Lupus Assessment Group (BILAG) disease scoring*’ and 
full biochemical, haematological and immunological profiling (Supplementary 
Table 4). 

A discrete disease flare required all three of the following prospectively defined 
criteria: (1) new BILAG score A or B in any system; (2) clinical impression of active 
disease by the reviewing physician; (3) the intention to increase immunosuppres- 
sive therapy as a result. 

Additional flares were defined in the absence of BILAG scoring if patients were 
admitted directly to hospital as emergency cases for increased immunosuppressive 
therapy. To differentiate between disease flares, clear improvement in disease 
activity was required in the form of diminished flare-related symptoms together 
with a reduction in both BILAG score and immunosuppressive therapy. 
Patients with IBD. Patients with active Crohn’s disease and ulcerative colitis were 
recruited from a specialist IBD clinic at Addenbrooke’s Hospital, before starting 
treatment. Diagnosis was made using standard endoscopic, histological and radio- 
logical criteria**. Patients who had already received immunomodulators or corti- 
costeroids were excluded. Enrolled patients were managed conventionally using a 
step-up strategy’. 

Assessment of disease activity was in accordance with national and inter- 
national guidelines and included consideration of symptoms, clinical signs and 
objective measures, including blood tests (C-reactive protein, erythrocyte sedi- 
mentation rate, haemoglobin concentration and serum albumin), stool markers 
(calprotectin) and mucosal assessment (by sigmoidoscopy or colonoscopy) where 
appropriate. Validated scoring tools were used as another means of assessing 
disease activity (Harvey-Bradshaw severity index* or simple clinical colitis activ- 
ity index”® for Crohn’s disease and ulcerative colitis, respectively), although these 
were not used to guide treatment decisions. All clinicians were blinded to the 
microarray results. 

For each disease, all patients were not included in all analyses as, for example, 
comparison of modular network analysis in related cell types required that sam- 
ples passing quality control filtering were available for all cell types for all patients. 
Our previous publications have shown that the sample sizes used here are adequate 
to detect reproducible signatures correlating with clinical traits. 

Follow-up analysis. Comparisons of outcome and associated clinical variables 
between subgroups were analysed using a Kaplan-Meier log-rank test and a non- 
parametric Mann-Whitney U-test or a 7° test as appropriate. Correction for 
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multiple testing was applied using the Bonferroni method or FDR (Benjamini 
and Hochberg method) where appropriate as indicated. 

Cell separation and RNA extraction. Venepuncture was performed at a similar 
time of day in all cases to minimize gene expression differences arising from 
circadian variation®”. PBMCs, CD4 and CD8 T cells were isolated from 110 ml 
of whole blood by centrifugation over ficoll and, for T cells, by positive selection 
using magnetic beads as previously described”. The purity of separated cell subsets 
was determined by flow cytometry and included as a covariate in downstream 
correlation and network analyses (for example, Fig. 1a, i). Total RNA was extracted 
from each cell population using an RNeasy mini kit (Qiagen) with quality assessed 
using an Agilent BioAnalyser 2100 and RNA quantification performed using a 
NanoDrop ND-1000 spectrophotometer. 

HsMediante 25k custom-spotted microarray. Total RNA (250 ng) was con- 
verted into double-stranded cDNA and labelled with Cy3- or Cy5-dCTP as prev- 
iously described”’. Appropriate Cy3- and Cy5-labelled samples were pooled and 
hybridized to custom-spotted oligonucleotide microarrays (HsMediante 25k) 
comprising probes representing 25,342 genes and control features**. All samples 
were hybridized in duplicate, using a dye-swap strategy, against a common ref- 
erence RNA derived from pooled PBMC samples. After hybridization, arrays were 
washed and scanned on an Agilent G2565B scanner. 

Affymetrix Human Gene ST microarray. Aliquots of total RNA (200 ng) were 
labelled using Ambion WT Sense Target labelling kit and hybridized to Human 
Gene 1.0 or 1.1 ST Arrays (Affymetrix) as described. After washing, arrays were 
scanned using a GS 3000 or Gene Titan scanner (Affymetrix) as appropriate. 
Published data sets. Published data sets were accessed through either National 
Center for Biotechnology Information (NCBI) Gene Expression Omnibus or 
ArrayExpress, imported into R using the Bioconductor package GEOquery and 
analysed as described. Search criteria incorporated the name of individual diseases 
and were filtered to human data sets but not by platform used. Studies were only 
included if they met the following criteria. (1) Similar quality control filters 
as applied to the data produced in-house were satisfied (described below). 
(2) Samples were taken at an analogous time-point to those from which the 
co-stimulation and exhaustion signatures in autoimmunity were identified; that 
is, samples taken during active disease without concurrent immunosuppressive 
therapy. (3) Clinical outcome data were available. 

It was not feasible to build a unified predictive model across all available data 
sets as they originated from different groups and were performed on mutually 
incompatible microarray platforms. 

For the HCV data used in Fig. 4c, a marked response was defined as an HCV 
titre decrease greater than 3.5 log;o(international units per millilitre (IU ml ')) 
and a poor response as an HCV titre decrease less than 1.5 logio(IU ml ') by day 
28 after commencing combined therapy with ribavirin and pegylated IFN-«. For 
the malaria vaccine trial used in Fig. 4d, ‘protection’ was defined as delayed or 
complete protection from subsequent confirmed Plasmodium falciparum infec- 
tion after standardized exposure (five bites) compared with infectivity control 
subjects. For the influenza data used in Fig. 4e, protection was defined as at least 
one high response to at least one (of three) included strains. A high response was 
defined as at least a fourfold increase in haemagglutination inhibition titre at day 
28 and a titre at least 1:40 as per US Food and Drug Administration guidelines. 

All gene expression data used have been deposited in publicly available repos- 

itories (National Center for Biotechnology Information (NCBI) Gene Expression 
Omnibus and ArrayExpress): AAV, SLE (E-MTAB-2452, E-MTAB-157, 
E-MTAB-145), IBD (E-MTAB-331), LCMV (GSE9650), HCV (GSE7123), 
malaria vaccination (GSE18323), influenza vaccination (GSE29619), yellow 
fever vaccination (GSE13486), dengue fever (GSE25001), IPF (GSE28221), T1D 
(E-TABM-666), NOD (GSE21897), rheumatoid arthritis (GSE15258, GSE33377), 
in vitro CD8 stimulation (E-MTAB-3470). 
Data preprocessing and quality control. For HsMediante 25k arrays, raw image 
data were extracted using Koadarray version 2.4 software (Koada Technology) and 
probes with a confidence score >0.3 in at least one channel were flagged as present. 
Extracted data were imported into R where log transformation and background 
subtraction were performed followed by within-array print-tip Loess normaliza- 
tion and between-array quantile and scale normalization using the Limma pack- 
age* in Bioconductor*’. Further analysis was then performed in R and only data 
demonstrating a strong negative correlation (7° > 0.9) between dye swap replicates 
were used in downstream analyses. 

Affymetrix raw data (.cel) files were imported into R and subjected to variance 
stabilization normalization using the VSN package in BioConductor*’. Quality 
control was performed using the Bioconductor package arrayQualityMetrics” 
with outlying samples removed from downstream analyses. Correction for batch 
variation was performed using the Bioconductor package ComBat*’ and batch 
structure was included as a covariate in downstream correlation analyses. 
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Clustering. Hierarchical clustering was performed using a Pearson correlation 
distance metric and average linkage analysis, performed either in Cluster with 
visualization in Treeview’, using Genepattern®, or directly in R using hclust in 
the statistics package. 
Differential expression. Differentially expressed genes were identified using lin- 
ear modelling and an empirical Bayes method” using an FDR threshold of 0.05 as 
indicated to determine significance. 
Weighted gene co-expression network analysis. Highly correlated genes in 
immune cell subsets were identified and summarized with a modular eigengene 
profile using the weighted gene co-expression network analysis (WGCNA) bio- 
conductor package in R“*. Normalized, log-transformed expression data were 
variance-filtered using the inflexion point of a ranked list of median absolute 
deviation values for all probes. A soft thresholding power was chosen on the basis 
of the criterion of approximate scale-free topology”. Gene networks were con- 
structed and modules identified from the resulting topological overlap matrix with 
a dissimilarity correlation threshold of 0.01 used to merge module boundaries and 
a specified minimum module size of n = 30. Modules were summarized as a 
network of modular eigengenes, which were then correlated with a matrix of 
clinical variables and the resulting correlation matrix visualized as a heat map 
(Extended Data Fig. 1). As each module by definition comprises highly correlated 
genes, their combined expression may be usefully summarized by eigengene pro- 
files**, effectively the first principal component of a given module (for example, 
Fig. 1b, f). A small number of eigengene profiles may therefore effectively ‘sum- 
marize’ the principle patterns within the cellular transcriptome with minimal loss 
of information. This dimensionality-reduction approach also facilitates correla- 
tion of modular eigengenes with clinical traits (for example, Fig. la, i). Significance 
of correlation between a given clinical trait and a modular eigengene was assessed 
using linear regression with Bonferroni adjustment to correct for multiple testing 
(Extended Data Fig. 1). Independent association of a given module eigengene or 
gene expression profile (for example, KAT2B) with clinical outcome was assessed 
using a multiple linear regression model. Significance of each term in the linear 
model was plotted against its regression coefficient, as a measure of the strength of 
association (the regression coefficient reflecting the change in clinical outcome per 
unit change in modular/gene expression), for example Extended Data Fig. 3b-e. 
Overlap of signatures with modules derived from network analysis is shown to 
the right of selected module heat maps (Fig. 1a and Extended Data Fig. 2a, e, f) by 
the following formula to allow correction for variable module size: [(signature 
genes overlapping with module genes, n)/(genes in module, n)] X 100. The overlap 
of randomly selected signatures of equivalent size was used as a control and is 
shown adjacent to the above plots. 
Hierarchical ordered partitioning and collapsing hybrid analysis. For valid- 
ation purposes, highly correlated genes were independently partitioned into dis- 
crete modules using a second algorithm, hierarchical ordered partitioning and 
collapsing hybrid (HOPACH”) in R. This approach differs from WGCNA in that 
it does not rely on a user-specified correlation threshold to define module bound- 
aries but rather aims to maximize homogeneity of modules. Normalized, log- 
transformed data were clustered using a hierarchical algorithm with modular 
boundaries defined by the median split silhouette, a measure of how well-matched 
a gene is to the other genes within its current cluster versus how well-matched it 
would be if it were moved to another cluster. On partitioning the data set into 
clusters, each cluster is reiteratively subdivided until the median split silhouette is 
maximized, thereby producing an optimal segregation into maximally discrete 
modules. 
Knowledge-based network generation and pathway analysis. The biological 
relevance of gene groups composing modules identified by co-expression analysis 
was further investigated using the Ingenuity Pathway Analysis platform™. Six 
modules from the CD4 T-cell WGCNA analysis showed significant correlation 
with clinical outcome in AAV after correction for multiple testing (Bonferroni 
method; Supplementary Table 3). We applied network and pathway enrichment 
analysis to genes composing these modules to determine whether they may have 
any biological relevance. Briefly, for network analysis, genes from a specified target 
set of interest are progressively linked together on the basis of a measure of their 
interconnection, which is derived from described functional interactions. 
Additional highly interconnected genes that are absent from the target genes (open 
symbols) may be added to complete a network of arbitrary size (set at n = 35). 
Networks may be ranked by significance which reflects the probability of ran- 
domly generating a network of similar size from genes included in the full know- 
ledge database containing at least as many target genes as in the network in 
question. For pathways analysis, the over-representation of canonical pathways 
(pre-defined, well-characterized metabolic and signalling pathways curated from 
extensive literature reviews) among a specified set of target genes is assessed, with 
significance determined by computing a Fisher’s exact test with FDR correction for 
multiple testing. 


GSEA. GSEA”' was used to further assess whether specific biological pathways or 
signatures were significantly enriched between subgroups of patients identified by 
gene modules (as opposed to testing for enrichment of pathways within modules 
themselves as outlined in the previous section). GSEA determines whether an a 
priori defined ‘set’ of genes (such as a signature) shows statistically significant 
cumulative changes in gene expression between phenotypic subgroups (such as 
patients with relapsing or quiescent disease). In brief, all genes are ranked on the 
basis of their differential expression between two groups then an enrichment score 
is calculated for a given gene set on the basis of how often its members appear at the 
top or bottom of the ranked differential list. One thousand random permutations 
of the phenotypic subgroups were used to establish a null distribution of enrich- 
ment score against which a normalized enrichment score and FDR-corrected q 
values were calculated. GSEA was run with a focused subgroup of gene signatures 
(as in Figs 2b and 3k)" selected to test the null hypothesis that different CD8 T-cell 
phenotypes were not significantly enriched in subgroups of patients identified by 
modular analysis. 

Selection of optimal PBMC-level biomarkers. Optimal surrogate markers facil- 
itating identification of the CD4 T-cell co-stimulation/CD8 exhaustion signatures 
in PBMC-level data were determined using a random forests classification algo- 
rithm*’ (Fig. 4a). Although signatures apparent in purified T-cell transcriptome 
data correlate with clinical outcome, they cannot be similarly detected in data 
derived from PBMC owing to the confounding influence of expression from other 
cell types; nor can the same genes be used to predict outcome in PBMC”. 
However, as CD4 T-cell co-stimulation and CD8 T-cell exhaustion signatures 
themselves showed close correlation, we hypothesized that this would amplify 
the signal detectable in PBMC and that detection of the combined CD4/CD8 
T-cell response might be feasible. The availability of both separated T-cell and 
PBMC data from the same patients at the same time facilitates a supervised search 
for surrogate markers of the co-stimulation/exhaustion signatures in PBMC. 
Expression data derived both from CD4 T cells and from PBMC were available 
for a cohort of n = 37 patients (AAV and SLE) after quality control and hybrid- 
ization to the HsMediante 25k custom microarray platform and constituted a 
training cohort. Normalized, log-transformed expression data were analysed using 
the MLInterfaces Bioconductor package in R*’. Using PBMC-level expression, 
data samples were classified into subgroups showing either high or low expression 
of the co-stimulation/exhaustion signature (as illustrated in Extended Data Fig. 5h, 
i) and probes were subsequently ranked using the variable importance metric on 
the basis of their ability to predict allocation to either group. The variable import- 
ance for a given gene reflects the change in accuracy of classification (percentage 
increase in MSE or increase in node purity) when that variable is randomly 
permuted. For a poorly predictive gene, random permutation of its values will 
minimally influence classification accuracy. Conversely, the most robust predic- 
tors will have a comparatively large effect on classification accuracy when ran- 
domly permuted. PBMC samples from a subset of n = 37 cases derived from the 
training cohort were labelled and hybridized on an alternative microarray plat- 
form (Affymetrix Gene 1.0 ST) as a technical validation set (Fig. 4b, left panel). 
PBMC samples from an independent cohort of n = 47 cases not included in the 
training cohort were labelled and hybridized to the Affymetrix Gene 1.0 ST plat- 
form as an independent test set (Fig. 4b, right panel). For both technical validation 
and independent test sets, expression of the optimal biomarker identified in Fig. 4a 
(KAT2B) was used to bisect the cohort relative to the median expression and 
clinical outcome was compared in patients with KAT2Bhi and KAT2Blo. 

Linear models. Linear modelling was performed in R using the statistics package. 
This took the form of 


fit< —Im(y~x14+x2+.x3, data=mydata), 


where y (the response variable) was selected as normalized flare rate (flares per 
number of days follow-up, and x1-xn (the test variables) were selected to include 
measures of disease activity (both clinical scores and laboratory markers of inflam- 
mation), quantification of circulating leucocyte subsets (lymphocytes, neutro- 
phils) and concurrent measurements of autoantibody titre where relevant. Test 
variables also included a biomarker profile (for example, exhaustion signature or 
KAT2B expression). The significance and magnitude (regression coefficient, 
reflecting change in response variable (flares per number of days follow-up) per 
unit change in each test variable included) were extracted and plotted against each 
other (for example, Extended Data Fig. 3b-e). Not all clinical or laboratory mea- 
sures were relevant comparisons in each case and therefore were not all included in 
every model generated. 

T-cell culture. Primary human CD8 T cells were separated from leucocyte cones 
obtained from NHS Blood and Transplant (Addenbrooke’s Hospital) by centrifu- 
gation over ficoll and positive selection using magnetic beads as previously 
described”. The purity of separated cell subsets was determined by three-colour 
flow cytometry. Purified T cells were labelled with 10 1M CFSE (Invitrogen) and 
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re-suspended in complete RPMI 1640 (Sigma Aldrich) in the presence of 10% 
FCS. Purified CD8+ T cells (>95%) were then stimulated in sterile, 96-well 
U-bottomed culture plates (Greiner) using an ‘artificial APC consisting of 
MACS iBead particles (1:2 bead:cell ratio, Miltenyi) or DynaBead particles 
(Invitrogen) conjugated to either CD3/CD28 or CD2/CD3/CD28 as indicated 
in the presence of IL-2 (10ngml~', Gibco Life Technologies) for 6 days. The 
magnetic iBead construct was removed after 36 h in some instances as indicated. 
In some experiments, additional co-stimulation was provided by the addition of 
either IFN-o. (10ngml~', Abcam) or by additional conjugation of recombinant 
Human PD-L1 Fc Chimera (Life Technologies, 1 1g ml *) or anti-CD40 antibody 
(50 ng ml’, Abcam) as indicated. The nature of co-stimulatory signals tested was 
based upon the results of the network analysis of CD4 T-cell modules described 
above (Supplementary Table 2). 

For re-stimulation experiments, cells were harvested on day 6 after stimulation 
and sorted into IL-7R" and IL-7R" populations (Extended Data Fig. 6d) using a 
FACSArialII cell sorter (BD Biosciences) with live/dead discrimination performed 
using an AquaFluorescent amine-reactive dye (Invitrogen). Cell numbers were 
normalized and were re-suspended in complete RPMI 1640 (2 X 10* per millilitre, 
Sigma-Aldrich) and ‘rested’ in a sterile, U-bottomed culture plate (Greiner) for 6 
days (37°C, 5% CO,) before being re-stimulated (anti-CD2/3/28 1:2 bead:cell 
ratio, Miltenyi MACSiBead) for a further 6 days in the presence of IL-2 (10ng 
ml !, Gibco Life Technologies). 

Note that, as described in Extended Data Fig. 6g, human memory CD8 T-cell 

subsets do not equivalently respond to the stimulation conditions described above. 
As primary whole human CD8 T cells are composed of highly variable proportions 
of memory subsets and whole CD8 T cells were stimulated, it was necessary to 
perform paired tests of significance when comparing resulting T-cell subsets and 
transcriptional profiles. 
Flow cytometry. Immunophenotyping was performed using an LSR Fortessa 
analyser (BD Biosciences), and data were analysed using FlowJo software (Tree 
Star). Reactions were standardized with multicolour calibration particles (BD 
Biosciences) with saturating concentrations of the following antibodies: 
AquaFluorescent Live/Dead (Invitrogen), IL-7Ra AF647 (BD biosciences, clone 
HIL-7R-M21), PDCD1 APC (eBioscience, clone MIH4). For intracellular staining, 
cells were fixed and permeabilized using a transcription factor staining buffer set 
(eBioscience) and before staining with saturating concentrations of antibody 
against BCL2 (BD Biosciences, clone 100). 
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Extended Data Figure 1 | Overview of weighted gene co-expression analysis. 
a, Messenger RNA derived from purified leucocyte subsets sampled during 
active, untreated autoimmune disease is labelled and hybridized to a microarray 
platform (both HsMediante 25k and Affymetrix Gene 1.0 ST used here). Genes 
are then combined into modules (b, coloured blocks) based on the similarity of 
their expression profile in all samples. c, Detail for the ‘black’ module. Each 
horizontal black line represents expression of a single gene within the given 
module; y axis, gene expression; x axis, patient samples; red bar, eigengene 
profile which effectively summarizes the expression of all genes constituting the 
black module. d, Each modular profile is related to all others in a hierarchy that 
can itself be visualized by plotting correlation of all module eigengenes, such as 
in the heat map shown here. Coloured blocks represent individual modules, 
defined as in a. Modules are aligned in identical order on x and y axes with heat- 
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map colour representing the correlation between each. Note that the diagonal 
(top left to bottom right) therefore represents the correlation of each eigengene 
profile with itself, and is always 1. Distance metric is the Euclidean distance. 
e, As each module is summarized by a representative eigengene profile, each 
may then be correlated against a range of clinical variables, allowing 
visualization of how the transcriptome relates to clinical variables, again in the 
form of a correlation heat map. Pearson correlation, r. f, Heat map showing 
gene expression modules (y axis) correlated against clinical variables (x axis) for 
the CD4 transcriptome in AAV. Pearson correlation, r. g, Heat map illustrating 
significance of correlations identified in f. P value threshold at Bonferroni- 
corrected P < 0.05. Colour bar indicates actual P value of correlations deemed 
significant; grey shading, corrected P > 0.05. Significance for co-stimulation 
(black) module from Fig. 1 is also shown (P = 0.0005). 
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Extended Data Figure 2 | Weighted gene co-expression network analysis of 
the T-cell transcriptome and its correlation with clinical phenotype in SLE. 
a, e, Heat maps illustrating the correlation of co-expression modules (coloured 
blocks, y axis) derived from the CD8 (a) and CD4 (e) transcriptomes of 23 SLE 
patients with clinical traits (x axis). Overlap of the previously described 
prognostic signature with co-expression modules, along with the distribution of 
a random signature of equivalent size, shown to the right of 

a (overlap = signature genes/module genes, as a percentage). Overlap of the 
CD4 T-cell co-stimulation ‘black’ module (defined in Fig. 1) shown to the right 
of e, with a randomly derived module and a type 1 IFN response signature 
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previously shown to associate with active SLE’. Overlap shown as percentage 
representation of the signature within each module. b, d, Linear plots 
illustrating the ‘charcoal’ (b) and ‘grey’ (d) modules in detail; y axis, gene 
expression; x axis, individual patients; coloured lines (red, blue), module 
eigengenes. c, Correlation of SLE CD4 T-cell co-stimulation module eigengene 
(x axis, blue) against SLE CD8 T-cell prognostic signature (y axis, red). Pearson 
correlation, r, with P = two-tailed significance. f, Expanded detail from 

e, illustrating that modules corresponding to type 1 IFN response and co- 
stimulation signatures correlate with disease activity and outcome respectively 
but not vice versa. 
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Extended Data Figure 3 | Identification and validation of genes involved in 
CD4 co-stimulation that correlate with clinical outcome, and how that 
relationship changes after treatment. a, A knowledge-based network analysis 
of 336 probes composing the ‘black’ expression module (Fig. le) identifies a 
network of co-stimulation signalling (Supplementary Table 3). Individual 
genes are shown in circles with the ‘strength’ of their connections indicated by 
the weight of the black bar linking them. Pathways of TCR signalling, inducible 
T-cell co-stimulator and its ligand (ICOS-ICOSL) signalling and CD28 
signalling are all significantly enriched in this module (FDR P< 0.05). 

b-e, Scatter plots showing the outcome of multiple linear regression models 
testing the association of four signatures (red symbols) as indicated, directly 
compared with clinical markers of disease activity (black symbols); x axis, 
magnitude of association (regression coefficient, change in normalized flare 
rate (flares per number of days follow-up) per unit change in each variable 
tested); y axis, significance of association in multiple regression model; P, 
significance threshold (dashed red line, P = 0.05). b, CD8 turquoise module 
eigengene in AAV, (c) CD4 co-stimulation (black) module eigengene in AAV, 
(d, e) CD8 exhaustion signature (Supplementary Table 6) in AAV/SLE (d) and 
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IBD (e). Clinical variables incorporated vary owing to differing relevance in 
each case but include some of the following: disease activity score (BVAS/ 
BILAG/CDAI/Harvey-Bradshaw score), C-reactive protein, autoantibody titre 
(PR3/MPO, dsDNA), lymphocyte count, neutrophil count, platelet count, IgG, 
IgA, IgM, erythrocyte sedimentation rate, age. f, Line plot showing mean 
expression of a CD8 T-cell exhaustion signature in 38 patients with AAV 
measured at presentation during active, untreated disease (fo) and 12 months 
later when disease activity was quiescent and patients were on maintenance 
immunosuppressive therapy (f,2). Patients are grouped into those falling above 
(red) and below (blue) median expression of the exhaustion signature 
eigengene at entry. P= Mann-Whitney test comparing f,2 and ty values. The 
difference between the groups that is easily apparent at enrolment with active, 
untreated disease (fo) is no longer apparent when disease is treated and 
quiescent 12 months later (t;2). g-i, Scatter plots showing inverse correlation 
between individual eigenvalues of the CD4 co-stimulation signature (x axis, 
red) and the CD8 exhaustion signature (y axis, blue) defined as in Fig. 2, for 
AAV (g), SLE (h) and IBD (i) cohorts. Pearson correlation, r’, two-tailed 
significance. 
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Extended Data Figure 4 | Wind rose plots showing relative GSEA 
enrichment of immune signatures in autoimmune disease and melanoma. 
Wind rose plots showing relative enrichment (GSEA FDR gq value) of distinct 
immune signatures between subgroups of patients (as defined as in Fig. 2). 

a, b, AAV; ¢, d, SLE; e, f, IBD. a, c, e, Enrichment of immune signatures from 
selected CD8 T-cell phenotypes; b, d, f, enrichment of signatures specifically 
up-/downregulated by CD8 T-cell subsets derived from the LCMV model of 
T-cell exhaustion (acute LCMV Armstrong versus chronic LCMV C113 (ref. 
8)). Detailed information on genes included in each signature is provided in 
Supplementary Table 6. g, h, Wind rose plots showing relative enrichment 
(GSEA FDR q value) of distinct immune signatures between CD8 T cells from 
patients with melanoma, comparing CD8 from tumour-infiltrated lymph node 
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with circulating CD8 T cells'®. g, Enrichment of immune signatures from 
selected CD8 T-cell phenotypes; h, enrichment of signatures specifically up-/ 
downregulated by CD8 T-cell subsets derived from the LCMV model of T-cell 
exhaustion (acute LCMV Armstrong versus chronic LCMV C113 (ref. 8)). 
Specific enrichment is seen for genes downregulated by exhausted cells but not 
for all genes upregulated by exhausted cells. c, Heat map showing differential 
expression of selected canonical co-inhibitory receptors (as for Fig. 2c (ref. 12)) 
in the LCMV exhaustion model, between prognostic subgroups identified in 
Fig. 2d, g, j and between exhausted CD8 T cells from melanoma-infiltrated 
lymph node compared with circulating tumour-specific CD8 T cells’®. Blue, up 
in exhausted; red, up in non-exhausted; grey, no significant change (FDR 
P<0.05). 
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Extended Data Figure 5 | T-cell co-stimulation with CD2, but not type 1 
IEN or anti-CD40, prevents development of an exhausted IL-7R'’°PD1™ 
phenotype during prolonged anti-CD3/28 T-cell stimulation. 

a-d, Representative scatter plots showing IL-7R expression (y axis) by cell 
division (CFSE dilution, x axis) in (a) unstimulated cells and following each of 
three different co-stimulation cultures: b, anti-CD3/CD28 alone; c, anti-CD2/ 
3/28; d, anti-CD40/3/28. IL-7R"-expressing subset indicated in black gate with 
the percentage of live cells shown. e-g, Line and scatter plots showing absolute 
number of IL-7R" cells (e), PD-1 expression (f) and cell death 

(g) (death = AquaFluorescent dye") during CD8 T-cell differentiation (x axis, 
number of divisions undergone by day 6 of culture measured by CFSE dilution) 
after anti-CD3/28 (blue) or anti-CD2/3/28 (red) stimulation. P = paired t-test, 
n= 5 paired samples. h, i, Hierarchical clustering of 44 patients with AAV (left 
panels) and 23 patients with SLE (right panels) using 336 genes composing a 
CD4 T-cell co-stimulation module (black module, Fig. 1) identifies two 
subgroups of patients (high co-stimulation, red; low co-stimulation, blue) in 
CD4 T-cell expression data defined by the first major division in the patient 
dendrogram. j, k, Scatter plots illustrating selected co-stimulatory and 
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co-inhibitory receptors for the subgroups identified in h and i. Selected 
receptors were chosen on the basis of their inclusion in networks derived from 
the co-stimulation and exhaustion signatures as illustrated in Extended Data 
Fig. 3a. 1, m, Line and scatter plots showing absolute number of IL-7R" cells 
(y axis) by number of divisions undergone at day 6 (x axis) after polyclonal 
stimulation with anti-CD3/28 (blue) or anti-CD3/28 plus anti-CD40 (1, green) 
or IFN-o (m, green) co-stimulation. n, Line and scatter plot showing extent of 
proliferation occurring (percentage of live cells on day 6 having undergone each 
of zero to four divisions) after polyclonal stimulation of primary human CD8 
T cells with CD3/28 alone (blue) or with additional anti-CD2 co-stimulation 
(red), confirming no difference in the extent of live cell proliferation between 
groups. 0, Absolute live (AquaFluorescent Dye) cell counts (y axis) by the 
number of divisions undertaken (x axis) by day 6 after polyclonal stimulation of 
primary human CD8 T cells with CD3/28 alone (blue) or with additional anti- 
CD2 co-stimulation (red), illustrating increased cell survival with CD2 
co-stimulation despite equivalent proliferation. P values = two-way ANOVA 
of four paired stimulations. 
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Extended Data Figure 6 | CD2 co-stimulation results in functionally 
distinct subpopulations showing enhanced survival after in vitro re- 
stimulation but no preferential expansion of CD8 memory subsets. 

a, Representative flow cytometry density plots of CD8 T cells showing BCL2 
expression on day 7 after stimulation with anti-CD3/28 (blue) or anti-CD2/3/ 
28 (red). Figures are the percentage of total CD8 T cells. b, Quantification of 
BCL2 expression in CD8 T cells stimulated as in a. P= Mann-Whitney, n = 5 
paired biological replicates per group. c, Scatter plots showing cytokine levels (y 
axis, picograms per millilitre) measured in supernatants of CD8 T cells on day 7 
after in vitro stimulation with either anti-CD3/28 (left column, blue) or CD2/3/ 
28 (right column, red). Samples represent paired stimulations of primary CD8 
T cells from the same individual using either stimulation protocol (n = 6 
biological replicates per group). d, Scatter plots illustrating populations sorted 
after polyclonal anti-CD3/28 (left panel) and anti-CD2/3/28 (right panel) 
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stimulation of primary CD8 T cells. e, Percentage of live cells (AquaFluorescent 
dye’) remaining 7 days after re-stimulation of each sorted subpopulation of 
CD8 cells. Cells were rested for 6 days in complete RPMI1640 medium without 
IL-2 before being re-stimulated with anti-CD2/3/28 for a further 7 days. 

P= Mann-Whitney; error bars, mean + s.e.m. f, Representative scatter plot 
illustrating CD8 T-cell memory populations isolated by flow cytometric sorting 
and stimulated in g, h. g, Scatter plot showing absolute number of IL-7R™ cells 
(y axis) on day 6 after anti-CD3/28 (blue) or anti-CD2/3/28 (red) stimulation of 
purified CD8 T-cell memory populations (x axis). *P < 0.05, Mann-Whitney 
test (n = 5 paired biological replicates per group). h, Scatter plots showing 
percentage CD8 T-cell memory subsets (y axis) resulting from stimulation of 
purified central memory (Tcm), naive (Tn), effector memory (Tem) and 
effector memory-RA (Temra) populations with anti-CD3/28 (blue) or anti- 
CD2/3/28 (red) for 6 days (n = 4 paired biological replicates per group). 
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Extended Data Figure 7 | Top PBMC surrogate markers reflect expression 
of CD4 co-stimulation/CD8 exhaustion modules within CD4 and CD8 data 
respectively. Top PBMC-level predictors (n = 13) were selected as indicated in 
Fig. 4a, and data are shown comparing expression of the optimal predictor 
(KAT2B, a, e) and of each other top predictor gene (d, h) in PBMC data 
compared with expression of the CD4 co-stimulation module eigengene in 
CD4 data (a-d) and the CD8 exhaustion signature eigengene in CD8 data 
(e-h) for n = 44 patients with AAV. Significance of correlation: *P < 0.05, 
**P < 0.01, ***P < 0.001. b, f, Scatter plots showing the outcome of multiple 
linear regression models testing the association of KAT2B expression in CD4 
(b) and CD8 (f) data (red symbols) directly compared with clinical markers of 
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disease activity (black symbols); x axis, magnitude of association (regression 
coefficient, change in normalized flare rate (flares per number of days follow- 
up) per unit change in each variable tested); y axis, significance of association in 
multiple regression model; P, significance threshold (dashed red line, P = 0.05). 
Clinical variables incorporated were disease activity score (BVAS), C-reactive 
protein, lymphocyte count, neutrophil count, IgG. c, g, Heat maps reproduced 
from Fig. 1a, i, respectively, showing overlap of top PBMC-level predictors with 
the modular analysis presented for CD4 (c) and CD8 (g) data in Fig. 1. As 
expected, surrogate markers showed stronger correlation with the CD4 than the 
CD8 signature as the algorithm was trained to detect the CD4 co-stimulation 
module. 
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Extended Data Figure 8 | Immune cell subset expression pattern of top stimulation/CD8 exhaustion signatures (from Fig. 4a) in a range of 22 immune 
PBMC-level surrogate markers of CD4 co-stimulation/CD8 exhaustion cell subsets. Genes showing significant correlation of expression with KAT2B 
signatures. Dot plots showing expression (median + s.e.m.) of KAT2B (a) and across all cell types are indicated (**P < 0.001). 

for each of 12 other top PBMC-level surrogate predictors of CD4 co- 
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Extended Data Figure 9 | Hierarchical clustering of multiple data sets using 
13 top PBMC-level surrogate markers of CD4 co-stimulation/CD8 
exhaustion modules identifies subgroups of patients with distinct clinical 
outcomes. Replication of association between surrogate markers of CD4 
co-stimulation/CD8 exhaustion signatures and clinical outcome (as shown in 
Fig. 4c-k) but using all top 13 PBMC-level surrogates rather than KAT2B alone. 
a, c, e, g, i, k, m, Heat maps showing hierarchical clustering of gene expression 
data of 13 top PBMC-level surrogate predictors of CD4 co-stimulation/CD8 
exhaustion signatures (from Fig. 4a) in patients with chronic HCV (a), during 
malaria vaccination (c), influenza vaccination (e), yellow fever vaccination 
(g), dengue fever infection (i), IPF (k) and pre-T1D (m). Subgroups were 


LETTER 


defined using a major division of the cluster dendrogram and group 1 allocated 
on the basis of KAT2B expression (highest in group 1). Clinical outcome 
associated with each subgroup identified is shown in b (HCV, percentage of 
responders to IFN-o/ribavirin therapy), d (percentage showing protection 
versus no protection from malaria vaccine), f (percentage response to influenza 
vaccination), h (yellow fever antibody-titre after vaccination), j (percentage 
progression to DHF), | (percentage of patients progressing to need for 
transplantation or death) and n (percentage of samples from patients with 
previous or subsequent progression to islet-cell antibody seroconversion or toa 
diagnosis of T1D). 
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Extended Data Figure 10 | Kinetics of KAT2B expression during treatment 
of chronic HCV, malaria and influenza vaccination, during T1D 
development in the NOD mouse and in PBMC data from patients with IBD 
and rheumatoid arthritis. a, Expression of a type 1 IFN response signature 
(average eigenvalue of type 1 IFN response signature plotted for each response 
group at each time point, A, signature as defined in ref. 4) in a cohort of 54 
patients during treatment of chronic HCV infection with pegylated IFN-o and 
ribavirin (as described in ref. 53 and Fig. 4c), including 28 showing a marked 
response (red line, HCV titre decrease >3.5 logy (IU ml” ty by day 28) and 26a 
poor response (HCV titre decrease <1.5 logi9(IU ml!) by day 28). P = two- 
way ANOVA. b, Schematic representation of the vaccination (black) and 
transcriptome profiling (red) schedule for the adjuvanted RTS,S malaria 
vaccine trial”* (as shown in Fig. 4d). b-d, Heat map (b) and line plots 

(c, d) illustrating temporal changes in expression of 404 genes representing the 
GO ‘inflammatory response’ module (c) or KAT2B expression (d) at each time- 
point during vaccination in patients with above- (red) and below- (blue) 
median KAT2B expression throughout the vaccination schedule outlined in 
b. Subgroups defined at T2, immediately after booster vaccination as this 
equates to the period of most ‘active’ immune response. Plots are mean + s.e.m. 
e, Schematic representation of the vaccination (black arrows) and 
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transcriptome profiling (red arrows) schedule for 28 vaccinees receiving the 
2008 seasonal influenza vaccination (combined trivalent inactivated influenza 
vaccine” as shown in Fig. 4e) with response assessed at day 28 by 
haemagglutination inhibition titre (green arrow). f, Linear plot illustrating 
temporal changes in expression of 404 genes representing the GO 
‘inflammatory response’ module at each time-point during vaccination (d0-d7 
corresponding to microarray bleed points in e for patients showing above- (red) 
or below- (blue) median expression of KAT2B at day 3 after vaccination; y, 
expression log,; x, time-point, days after vaccination; P = two-way ANOVA. 
g, Linear plot showing ratio of Kat2b expression in peripheral blood of NOD 
mice (y axis, n = 37 mice in total across six time points) before and during the 
induction and onset of insulitis and the development of overt diabetes 
(illustrated by black bars below); x axis, age (days); y axis, Kat2b expression log, 
ratio versus B10 controls”. h, Kaplan-Meier censored survival curve showing 
flare-free survival (y axis) during follow-up (x axis) of n = 58 patients with IBD 
stratified by KAT2B expression (red, above median; blue, below median). 

P= log-rank test. i, j, Box plots showing clinical response (percentage 
responders) 3 months after treatment with anti-TNF therapy in two 
independent cohorts (I** and J°°) of patients with rheumatoid arthritis (RA). 
P= Fisher’s exact test. Linear plots show mean + s.e.m. throughout. 
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Intracellular energy distribution has attracted much interest and 
has been proposed to occur in skeletal muscle via metabolite- 
facilitated diffusion’*; however, genetic evidence suggests that 
facilitated diffusion is not critical for normal function*’. We 
hypothesized that mitochondrial structure minimizes metabolite 
diffusion distances in skeletal muscle. Here we demonstrate a mito- 
chondrial reticulum providing a conductive pathway for energy 
distribution, in the form of the proton-motive force, throughout 
the mouse skeletal muscle cell. Within this reticulum, we find pro- 
teins associated with mitochondrial proton-motive force produc- 
tion preferentially in the cell periphery and proteins that use the 
proton-motive force for ATP production in the cell interior near 
contractile and transport ATPases. Furthermore, we show a rapid, 
coordinated depolarization of the membrane potential component 
of the proton-motive force throughout the cell in response to spa- 
tially controlled uncoupling of the cell interior. We propose that 
membrane potential conduction via the mitochondrial reticulum 
is the dominant pathway for skeletal muscle energy distribution. 

The mechanism by which the forms of potential energy generated 
by oxidative phosphorylation are distributed within skeletal muscle 
cells has been the subject of active research for many years. Two major 
facilitated diffusion mechanisms have been proposed. The creatine 
kinase shuttle system proposes that creatine phosphate and creatine 
are cytosolic facilitated diffusion partners with ATP and ADP’, while 
the second model proposes that the oxy-deoxy myoglobin shuttle aids 
diffusion of oxygen from capillaries to mitochondria’. However, in the 
absence of myoglobin’, creatine kinase’, or creatine’, mice survive 
with near-normal skeletal muscle performance and only modest adap- 
tations”’, suggesting that these facilitated diffusion pathways are not 
critical for normal muscle function. Thus, a re-examination of the 
pathways for distributing potential energy within skeletal muscle 
is warranted. 

Weselected high-resolution, three-dimensional (3D, 15-nm isotropic 
voxels) focused ion beam scanning electron microscopy (FIB-SEM)*? 
to better define the diffusion pathways in mitochondria-rich muscle 
fibres. Previous investigations of muscle 3D structure’ ** have yielded 
conflicting results, probably due to the size of the serial sections 
(60-90 nm), a lack of true 3D information’’, and/or differences 
between muscle types studied”’. 

The contrast in the FIB-SEM image stack (Supplementary Video 1, 
raw data available at http://labalaban.nhlbi.nih.gov/files/SuppDataset.tif, 
see also Supplementary Information) was sufficient to enable auto- 
mated segmentation of mitochondria, myofibrils, blood vessels, red 
blood cells and nuclei (Fig. 1a, b) and revealed four different mitochon- 
drial morphologies (Supplementary Video 1): paravascular mitochon- 
dria (PVM) (Fig. 1c), I-band mitochondria (IBM) (Fig. 1c-e), fibre 
parallel mitochondria (FPM, Fig. 1d, e), and cross fibre connection 
mitochondria (CFCM, Fig. 1d). Consistent with our previous in vivo 
microscopy studies”’”"*, the PVM pool, often associated with a nucleus, 
was evident in mitochondria-rich fibres when imaged by FIB-SEM. 


Tracing the path of the PVM often revealed multiple or single direct 
connections (that is, a continuous outer membrane) with IBM” 
(Fig. 2b). A PVM could be coupled to one or both IBM on opposing 
sides of the z-line (Supplementary Video 2). CFCM were less frequent 
and also seen as pairs of tubules proceeding transversely on both sides of 
a z-line. However, CFCM were oriented perpendicularly to IBM 


Figure 1 | Muscle mitochondria form highly connected networks. a, 3D 
surface rendering of 25.53 X 24.06 X 4.23 um FIB-SEM volume segmented to 
show spatial relationships between mitochondria (green) and other structures 
(nucleus (N), cyan; capillary (V), magenta; red blood cell, red; myofibrils, 
grey). b, Removing myofibrils highlights different morphologies within intra- 
fibrillar mitochondrial (IFM) network. c-e, Zooming in reveals projections 
from paravascular mitochondria (PVM) into I-band mitochondria (IBM) (c), 
and numerous interactions between IBM and cross-fibre connection 
mitochondria (CFCM) (d) and fibre parallel mitochondria (FPM) (d, e). 

Scale bars, 3 jum. Representative of eight separate volumes analysed 

from four animals. 


National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA. ?National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA. 
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Figure 2 | Muscle mitochondrial morphologies. a, 3D rendering of 
mitochondria from FIB-SEM volume. Non-white colours indicate individual 
mitochondria. Scale bar, 750 nm. b, PVM projecting into IBM. c, Mitochon- 
drion showing IBM and CFCM. d, Repeating connections between IBM and 
FPM. e, Myofibril (grey) and associated mitochondria (green). f, FRM with 
IBM and CFCM branches. Dotted lines indicate z-lines. g, Deconvolved super- 
resolution image of TMRM-loaded myocyte. LD, lipid droplet. Insets are 
magnified 3 from original. h, Mean mitochondrial diameters. Error bars 
indicate standard error. Quantification from raw images. Significantly different 
from PVM (*), IBM (+), FPM ({) (ANOVA, P< 0.05). Images represent 
data from: FIB-SEM, 8 fibres, 4 mice; STED, 13 fibres, 3 mice. 


(Fig. 2c, d). Sarcomeres were observed to have associations with mul- 
tiple mitochondrial structures (Fig. 2e). This analysis demonstrates 
that PVM are directly coupled to most IBM in the muscle, providing 
a conductive pathway for potential energy transfer from the periphery 
to deep inside the muscle. Skulachev proposed over 40 years ago that 
the electrical component of the proton-motive force, the membrane 
potential (AY), could be used by the cell as a transportable form of 
potential energy” and later demonstrated that mitochondria could be 
electrically coupled in cultured cells”. 

PVM were highly irregular (Supplementary Video 1), mostly ellips- 
oidal structures. Their maximal cross-sectional diameter ranged from 
300 to 1,600 nm (Extended Data Table 1). PVM that were contiguous 
with IBM (21.2 + 3.2% of 1,152 PVM analysed, 4 animals) were 
located primarily at the PVM-myofibril interface (Extended Data 
Fig. 1a). Although a small fraction of PVM was directly coupled to 
IBM, the entire PVM pool (99.9 + 0.1%) was linked via electron-dense 
contact sites (EDCS) between membranes of adjacent PVM (Extended 
Data Fig. 1b). These apparently dynamic EDCS””’ may represent sites 
of fission or fusion’* as well as simply reflect tight packing of the 
mitochondria; however, the functional significance of these junctions 
is poorly defined. If EDCS are electrically conductive, these sites would 
couple essentially all PVM directly to IBM running deep into the 
muscle, creating a functional syncytium with regard to the proton- 
motive force (AY + pH gradient across the inner mitochondrial 
membrane) to generate ATP. 

IBM were regular cylinders with a maximal cross-sectional diameter 
range of <100-678 nm (Extended Data Table 1) and, often, lengths 
>20 um (Fig. 2), suggesting that they extend deep into the tissue. FPM 
(Fig. 2d, f) displayed maximal cross-sectional diameters ranging from 
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362 to 1,495nm (Extended Data Table 1). 99.7+ 0.3% of FPM 
(n = 776, 4 FIB-SEM data sets, 4 animals) were associated with adja- 
cent FPM through EDCS (Extended Data Fig. 1d). Interconnection of 
different intra-fibrillar mitochondrial (IFM) morphologies was extens- 
ive. Indeed, 81.7 + 3.6% of FPM were directly connected (continuous 
outer membrane) to an IBM (Extended Data Fig. 1c, n = 305 mito- 
chondria from 4 FIB-SEM data sets from 4 animals). Mitochondria 
lacking either direct or EDCS connections were rare (~1-2 per data 
set) and may reflect either damaged mitochondria removed from the 
network" or newly formed mitochondria yet to establish a network 
connection’*”’. For a better appreciation of the mitochondrial reticu- 
lum throughout the entire cell, we collected 3D, multiphoton micro- 
scopy (MPM) images of mitochondria-rich fibres in situ. Although the 
spatial resolution was ~20-fold lower than with FIB-SEM, these MPM 
images (Extended Data Fig. 2 and Supplementary Videos 3, 4 and 5) 
suggest that the morphology observed by FIB-SEM at the periphery 
of the fibre persists throughout the entire cell. Additionally, super- 
resolution microscopy of mitochondria using AY-dependent tetra- 
methylrhodamine (TMRM) in live, isolated muscle fibres provided 
further validation that the mitochondrial morphologies observed with 
FIB-SEM are representative of the in vivo condition (Fig. 2g, h, 
Extended Data Fig. 3 and Supplementary Video 6). 

On the basis of the mitochondrial structures, we hypothesized that 
PVM are primarily involved in generation of the proton-motive force 
near the capillaries, while IBM use the proton-motive force to produce 
ATP near the ATPase activity. Along this line of thought, the distri- 
bution of the enzymes associated with production and utilization of 
the proton-motive force may be linked to the function of PVM and 
IBM pools, thereby optimizing the regional inner membrane concen- 
tration of enzymes to its functional requirements. Again, the PVM 
generation of A¥ is delivered to the IBM system via the mitochondria 
reticulum. To test this hypothesis, we compared the cellular distri- 
bution ofa proton-motive-force-generating element, complex IV, with 
the enzyme complex responsible for phosphorylation of ADP to ATP 
using the proton-motive force, complex V. Figure 3a—c shows that 
complex IV (green) was higher in the periphery of the cell, consistent 
with the PVM pool, while complex V (red) was more concentrated in 
the intra-fibrillar region (Fig. 3 and Supplementary Video 7). Notably, 
complex IV and complex V were found throughout the cell 
(Supplementary Video 8 and Extended Data Fig. 4), just at different 
proportions in the different mitochondrial pools. Antibody specificity 
was validated by the appearance of single bands corresponding to 
complexes IV and V in western blots of mitochondrial proteins sepa- 
rated by clear native PAGE (Extended Data Fig. 5). 

To test our hypothesis that the mitochondrial reticulum is coupled 
via a contiguous matrix and/or EDCS, we performed localized mito- 
chondrial uncoupling experiments and observed whether the assoc- 
iated drop in AY propagated throughout the mitochondrial pools in 
isolated muscle cells. In this study, we simulated an increase in intra- 
fibrillar complex V activity with a photo-activated uncoupler* and 
observed the cellular topology of AY via TMRM distribution. Upon 
a mild depolarization of a small interior cell region, a rapid (<200 ms) 
and near-uniform drop in AY occurred across the cell, including a 
homogenous decrease in the PVM, seen as a redistribution of TMRM 
from mitochondria to cytosol and nuclei (Fig. 4 and Supplementary 
Video 9). Note that capillary grooves in the sarcolemma and adjacent 
PVM are still present even after vessel removal by the fibre isolation 
process (Extended Data Fig. 6). Figure 4k shows the mean post/pre 
change in TMRM intensity for mitochondrial and cytosolic plus nuc- 
lear pixels in each cell region. This analysis resulted in isolated signals 
from the cytosol, nucleus and PVM pools. However, all of the intra- 
fibrillar mitochondria were grouped together in this analysis. While 
linear intensity analysis (Extended Data Fig. 7a—d) could be used to 
assess IBM in some cases, any inclusion of FPM in the analysis window 
would be confounding, thereby limiting the image regions available for 
analysis. We were able to isolate the entire IBM pool based on their 
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Figure 3 | Capacity for membrane potential conduction. a, 3D rendering of 
muscle fibre immunostained for both complex IV and complex V. Confocal 
image coloured according to complex IV/complex V ratio. Relatively higher 
complex IV, green pixels; relatively higher complex V, red pixels. Nuclei, blue. 
b, c, Separation of PVM around fibre periphery (b) and IFM (c) within fibre as 
used for calculations. d, PVM have relatively greater capacity for membrane 
potential generation while IFM have greater capacity for membrane potential 
utilization. Images are representative of data from 12 fibres, 5 mice. Error bars 
indicate standard error. Asterisk indicates significantly different from PVM 
(paired t-test, P< 0.05). Scale bars, 15 um. 
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Figure 4 | Mitochondrial membrane potential conduction. Regional 
uncoupling of fibre loaded with TMRM and MitoPhotoDNP. White dotted 
lines indicate region of MitoPhotoDNP UV activation. a-c, Pre-UV 
whole-cell (a), mitochondria (b) and cytosol plus nuclei (c) images. 

d-f, Post-UV whole-cell (d), mitochondria (e) and cytosol plus nuclei 

(f) images. g-i, Post/pre-UV ratios for whole-cell (g), mitochondria (h) and 
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regular ~2,m spacing (or 0.541m‘' spatial frequency) using a 
horizontal fast Fourier transform (FFT). A decrease (61 + 3% 
(n=9)) in the FFT-detected IBM pool occurred with a more even 
TMRM distribution between the IBM and cytosol (Extended Data 
Fig. 7e, f). Conversely, ultraviolet (UV) exposure in control cells 
resulted in a small, uniform drop in TMRM in both the cytosol and 
mitochondria in the stimulated region, consistent with slight TMRM 
photobleaching, and little effect in other regions of the cell, including 
the IBM pool (Extended Data Figs 7f and 8 and Supplementary Video 
10). Restriction of UV light to designated regions of interest (ROIs) 
was confirmed in fluorescent test slides (Extended Data Fig. 9). 

These data are consistent with a tight, rapid electrical coupling 
mechanism between mitochondria, and that the PVM pool is a 
functional syncytium with regard to AY. The rapid, homogeneous 
response to the modest uncoupling, as shown in Fig. 4 and 
Supplementary Video 9, is indicative of a highly coupled, conductive 
network, although the mechanisms for this are yet to be fully resolved. 
We suggest, based on our structural data, that the contiguous matrix 
elements between mitochondrial pools are one of the obvious conduct- 
ive elements of the network. We speculate that the EDCS are also 
conductive elements specifically within the PVM and FPM pools. 
Mitochondrial fission and fusion’® may also have a role in reticulum 
development and modulation. It is important to note that the putative 
conduction through the EDCS could also be dynamically modulated to 
affect mitochondrial electrical coupling. 

Energy requirements in skeletal muscle can nearly instantaneously 
increase by more than 100-fold during intense contraction’’. Thus, 
using electrical conduction as opposed to oxygen or ATP diffusion is 
a more effective way to quickly and uniformly distribute energy 
throughout these cells and may explain why disruption of the known 
ATP and oxygen facilitated diffusion systems results in only modest 
phenotype changes*®”. If electrical conduction via the mitochondrial 
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cytosol plus nuclei (i). j, Cell regions. k, Near-uniform redistribution of TMRM 
from mitochondria to cytosol plus nuclei consistent with homogenous 
depolarization of all cell regions. Mean + standard error from 11 experiments, 
4 animals. Significantly different (ANOVA, P< 0.05) values are indicated for 
post/pre ratio of 1.0 (*), control (+) (Extended Data Fig. 8). No statistical 
differences between cell regions. 
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reticulum is the major mechanism for potential energy distribution 
within the cell, this raises the question of why such a large investment 
in creatine kinase, creatine and myoglobin has been retained by the 
muscle. In the knockouts of these systems, small decrements, ~ 10%, in 
peak performance have been noted. Although these are often dis- 
counted as small effects, a 10% advantage in peak muscle performance 
would probably provide a significant evolutionary advantage, justify- 
ing the retention or development of these two complementary systems. 
Asa reference, athletic performance is often measured in fractions of a 
per cent; evolutionary advantages might have a similar sensitivity. We 
propose that this conductive pathway is the major mechanism for 
energy distribution in skeletal muscle cells under normal conditions, 
and the metabolite facilitated diffusion pathways only become signifi- 
cant as one approaches maximum performance levels. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Mice. Male C57BL/6 mice, 2-4 months old, were purchased from Taconic Farms 
(Germantown, NY). All mice were fed ad libitum and kept on a 12h light, 12h 
dark cycle at 20-26 °C. The experiments were not randomized; the investigators 
were not blinded to allocation during experiments and outcome assessment. 
Animal preparation. All procedures were approved by the National Heart, Lung, 
and Blood Institute Animal Care and Use Committee and performed in accord- 
ance with the guidelines described in the Animal Care and Welfare Act (7 USC 
2144 ). Mice were anaesthetized and the tibialis anterior (TA) muscle prepared by 
removing the skin and outer layers of fascia as described previously~*. 

Focused ion beam scanning electron microscopy (FIB-SEM). Mouse hindlimbs 
were stabilized in the lengthened position and submerged in vivo in 5% glutar- 
aldehyde in 0.12 M sodium cacodylate, pH 7.35 (fixative) for 15 min. After initial 
fixation, the TA was excised, placed into fresh, cold fixative, and cut into small 
strips. Fixed muscles were post-fixed with 1% osmium tetroxide, stained with 1% 
uranyl acetate, dehydrated with an ethanol series and propylene oxide, and embed- 
ded in EMbed-812 (Electron Microscopy Sciences, Hatfield, PA). 

Nine data sets of mouse tibialis anterior muscle from four animals were col- 
lected on mitochondria-rich skeletal muscle fibres. One data set failed due to 
technical difficulties during data collection. A Zeiss Nvision 40 microscope was 
used for FIB-SEM imaging. The areas of muscle were chosen for FIB-SEM data 
collection following survey of 0.5-1-j1m-thick sections of resin-embedded muscle 
tissue; sections were created using an Ultracut S microtome from Leica Micro- 
systems. The sections were stained with Azure II/Toluidine blue that imparts 
colour to common morphological elements such as nuclei, plasma membranes 
and mitochondria. Once stained, the orientation and morphology were assessed 
using a light microscope. Muscle fibres were selected based on the presence of large 
accumulations of mitochondria in the paravascular region adjacent to a capillary 
embedded in the muscle cell’. Muscle fibres in the correct orientation with good 
morphology were chosen for FIB-SEM data collection and digital images were 
taken at 10X, 40 and 60X magnification. These images were used as maps to 
pinpoint the previously selected areas for data collection in the FIB-SEM. 

SEM images were collected with 1.5kV landing energy using the ESB (back- 

scattered electron) detector. Parameters were chosen to provide 8-bit images with 
pixel sizes x = 5 nm, pixel size y = 5 nm, and z thickness = 15 nm. The FIB aper- 
ture sizes used for data collection were 300pA and 700pA. Automated data 
collection was performed using Atlas 3D software, which is integrated into the 
Zeiss platform, manufactured by Fibics”’. 
Focused ion beam scanning electron microscopy image processing and 
analysis. Images (tifs) of each data set were aligned to each other using a cross- 
correlation algorithm”®, then binned 3 X 3 in x and y directions to create isotropic 
volumes with edge dimensions of 15 nm. These volumes were initially analysed 
using IMOD open source software package’. Whole image segmentation was 
performed by first applying a 3D, two pixel median filter on a FIB-SEM data set 
and then manually adjusting the threshold to create binary images primarily 
including only mitochondria, myofibrils, nuclei, blood vessels, red blood cells, 
and/or z-lines. Thresholding often resulted in inclusion of two or more types of 
cellular structures. Individual structures were then segmented by subtracting 
images that had been differentially thresholded and/or using the Remove 
Outliers tool in ImageJ (National Institutes of Health, Bethesda, MD). 

Using the TRAKEM2 plugin in Image], the mitochondrial volumes were manu- 
ally traced throughout the 3D volumes by two different observers. Most of the IBM 
structures were limited by the field of view of the 3D volumes, thus, the long axis 
extent of these structures through the cell were potentially grossly underestimated. 
For studies designed to estimate the number of PVM or FPM that have direct 
coupling to the IBM (that is, a continuous outer membrane), tracking was initiated 
at the middle of the 3D stack to minimize the effect of the field of view on detecting 
complete mitochondrial volumes. All PVM or FPM were then tracked through the 
3D volume. Those mitochondria with a continuous projection into an IBM were 
labelled with a green spot. Those without a projection to an IBM were labelled with 
a red spot. PVM or FPM that continued out of the field of view without an IBM 
projection were labelled with a blue spot. 

To determine the potential network of coupling between the mitochondria via 
EDCS, the same mid-volume planes were used. Initiating in the centre of a PVM 
pool or in the intra-fibrillar space for FPM analysis, every mitochondrion that was 
‘connected’ via an EDCS was marked with a magenta marker. The PVM with IBM 
projections were also labelled with a green marker. Thus, PVM that project into 
IBM and also have EDCS were labelled with both green and magenta dots. The 
EDCS coupled most PVM and most FPM even without looking through the entire 
3D volume. However, when using the full 3D volume, nearly all of the mitochon- 
dria were coupled via the EDCS structures. Mitochondria not coupled via an 
EDCS were labelled in orange in these studies; uncoupled mitochondria were 
rarely observed. 
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Transmission electron microscopy. Selected specimens for FIB-SEM imaging 
were sectioned to produce ~100-150-nm-sized thin sections that were stained 
with uranyl acetate and lead citrate before imaging on a JEM 1400 electron micro- 
scope (JEOL USA, Peabody, MA) with an AMT XR-111 digital camera (Advanced 
Microscopy Techniques Corporation, Woburn, MA). 

Multi-photon image acquisition. Mouse TA muscles were imaged in situ after 
cutting the hindlimb just above the knee, leaving the lower hindlimb and the origin 
and insertion of the TA intact. The lower hindlimb was placed in a deformable cast 
ina Petri dish with the TA in the lengthened position and immersed in phosphate- 
buffered saline. Images were acquired using a Leica TCS SP5 II upright, resonant 
scanning, multi-photon microscope with a Nikon 25X, 1.1 NA water immersion 
objective. The exposed muscle was imaged with a pulsed Ti:sapphire laser tuned to 
720 nm (Spectra Physics, Irvine, CA, USA), and all emitted light was collected by 
two hybrid detectors (HyDs, Leica Microsystems) separated by a 545 nm dichroic 
mirror. 3D image stacks of endogenous NAD(P)H were collected as 1,024 X 1,024 
pixel, 12-bit images with 100 nm x and y pixel sizes and 100 nm z-steps between 
images. Image stacks were deconvolved using the measured point spread function 
and the 3D Parallel Iterative Deconvolution plugin in ImageJ for presentation 
purposes only. 

Dual muscle section immunostaining. Mouse TA muscles were frozen in the 
lengthened position and embedded in optimal cutting temperature compound 
(OCT). Cryosections were cut and air dried for 5 min before fixation in 10% 
buffered formalin for 7 min at room temperature for confocal imaging. Samples 
were washed three times in PBS for 5 min each and permeabilized with 0.01% 
Triton-X 100 in PBS for 5 min. Sections were blocked with 10% goat serum for 
20 min before incubation with primary antibodies. Distribution of Complex V was 
assessed using anti-ATPB rabbit polyclonal IgG antibody from Abcam (ab128743; 
Cambridge, MA) and complex IV distribution was assessed with anti-complex IV 
subunit I mouse monoclonal IgG2a from Life Technologies (1D6E1A8; 
Grand Island, NY). Primary antibodies were incubated overnight at 4 °C at 1:75 
(complex V) and 1:40 (complex IV) dilutions. After three 5 min washes in PBS, 
samples were incubated with secondary antibodies for 1h at room temperature. 
Secondary antibodies used were TRITC goat, anti-rabbit IgG (1:100) and FITC 
goat, anti-mouse IgG (1:100) from Jackson ImmunoResearch Laboratories (West 
Grove, PA) for confocal imaging. To reduce non-specific binding of the anti- 
mouse secondary antibody, the MOM Fluorescein Kit from Vector Laboratories 
(Burlingame, CA) was used. Samples were then washed with PBS and mounted 
with VectaShield and DAPI (Vector) for confocal imaging. Slides were prepared 
for sections with each primary antibody separately, both antibodies together, and 
with primary antibodies omitted (negative control) from 6 TA muscles from 5 
mice. No statistical methods were used to predetermine sample size—with the 
limited information available, we selected the sample size based on prior experi- 
ence with immunostaining procedures in muscle. 

Immunostained muscle image acquisition. For confocal imaging, stacks of high 
resolution, 12 bit images (100 nm x and y pixel size) were collected sequentially 
throughout the depth of skeletal muscle tissue sections at 0.2 jum depth intervals 
using a 63X 1.4 NA oil immersion objective on a Leica SP5 confocal system (Leica 
Microsystems, Mannheim, Germany), or with a63X 1.4 NA oil immersion object- 
ive on a Zeiss LSM780 confocal microscope (Carl Zeiss Microlmaging, Jena, 
Germany). FITC-labelled samples were imaged with a 488-nm excitation laser 
and a 505-550 nm acquisition window and TRITC-labelled samples were imaged 
with a 561-nm excitation laser and a 570-620 nm acquisition window. DAPI- 
labelled samples were imaged with a 405 nm excitation laser and a 430-490 nm 
acquisition window. 

Immunostained muscle image analysis. Spatial, ratiometric analysis of the dis- 
tribution of complex IV and V within the muscle was performed. First, binary 
image masks were created by thresholding the 3D image stacks for both the 
complex IV and V signals, thereby allowing the removal of all pixels with negligible 
signal. Thus, ratiometric analysis proceeded only for pixels where signal could be 
detected. Thresholding was performed in ImageJ using Li's Minimum Cross 
Entropy thresholding method. Ratio images were created by dividing the complex 
IV signal by the complex V signal for all remaining pixels. The look-up table for the 
ratiometric images was set as follows: the mean of the middle 80% of the data and 
removed pixels were set to black; green intensity was linearly increased from 0 at 
the mean up to 255 at 2X mean and above; red intensity was linearly increased 
from 0 at the mean up to 255 at 0.5 mean and below. The look-up table is 
included in Fig. 4a for reference. The spatial distribution of green (relatively higher 
complex IV) and red (relatively higher complex V) pixels was assessed for the 
PVM and IFM pools by tracing the outer boundary of the fibre and tracing the 
inner boundary of the PVM pool, allowing separate analysis of each region. The 
total intensity of green pixels was divided by the total intensity of red pixels for 
each region. This analysis was performed on 12 total fibres from 5 mice: 6 fibres in 
cross-section and 6 in longitudinal section. 
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Additionally, the fluorescent intensity profiles of the complex IV and V signals 
were assessed individually with respect to their distance from the fibre boundary. 
The background fluorescence for each channel was determined as the mean of a 
manually drawn ROI outside the fibre of interest where no other fibre was present. 
Background was subtracted from all pixels in each image. A binary mask of the 
fibre was created by manually tracing the fibre of interest, and a distance map from 
the fibre boundary was created using ImageJ. This distance map allowed for the 
selection of all pixels in the complex IV and V images that were a given distance 
from the fibre boundary and the mean value for each distance calculated. This was 
performed for both the complex IV and complex V signals individually. For a 
given image, all values for the complex IV intensity profile were divided by the 
maximal complex IV signal measured near the fibre boundary and all values for 
the complex V intensity profile were divided by the maximal complex V signal 
measured near the fibre boundary. This analysis was performed on images of 
4 fibres from 4 mice. 

Clear native PAGE and western blotting. Mouse skeletal muscle mitochondria 
were used to test the specificity of the primary antibodies used in the whole muscle 
immunostaining studies. Skeletal muscles from the hindlimbs of two mice 
(~2.5 g) were dissected and combined for mitochondrial isolation as described 
previously” except omitting the Percoll gradient and subsequent washes. 
Cytochrome a,a3; (cyt a) content was determined as described previously”. 
Clear Native (CN) PAGE was performed using the NativeP AGE Novex Bis-Tris 
System (Invitrogen, Carlsbad, CA) with 4-16% 1 mm bis-tris gels and the anode 
and cathode buffers for high resolution CN PAGE 2 (hrCNE2)**. Mitochondria 
(10 pmol cyt a) were solubilized with 3% w/v dodecyl maltoside and added to each 
lane and runat 4 °C for 1 hat 150 V and 1.3 hat 250 V. Proteins were transferred to 
a PVDF membrane using a XCELL II blot module (Invitrogen, Carlsbad, CA) at 
25V for 3h at 4°C. After transfer, the PVDF membrane was incubated in 8% 
acetic acid for 15 min, rinsed with water, placed in Pierce StartingBlock buffer 
(ThermoScientific, Rockford, IL) for 1h at room temperature, and incubated in 
the primary antibodies used above (1:1,000 in StartingBlock) overnight at 4 °C. 
The membrane was then washed for 20min in StartingBlock buffer and two 
more times for 20min in PBS + 0.05% Tween 20. Incubation with secondary 
antibodies (Alexa 488 donkey anti-mouse IgG and Alexa 488 goat, anti-rabbit 
IgG, Abcam 150105 and 150077, respectively) was done at a 1:10,000 dilution 
in PBS + Tween 20 for 1.5h at room temperature in the dark. Membranes were 
washed 2X in PBS + Tween 20 for 20 min in the dark and imaged on a Typhoon 
variable mode imager. 

Single muscle fibre isolation and imaging. For these studies, mouse soleus 
muscle fibres were used to increase the probability of acquiring mitochondria-rich 
muscle fibres. The basic incubation medium (IM) was composed of (in mM): 
NaHEPES (10), NaCl (137), KCl (5.4), CaCl (1.8), MgCl, (0.5), NaPO, (0.5), 
glucose (10), NaPyruvate (1) and butanedione monoxomine (BDM, 20). The 
soleus muscles were quickly isolated tendon to tendon. The intact muscle was then 
placed in the IM solution containing 3 mg ml‘ collagenase D (Roche) for 30 to 
45 min in a shaking water bath at 37 °C. The tissue was removed from the collage- 
nase solution and placed in the IM solution alone. The fibres were gently teased 
from the muscle bundle and long, non-contracted fibres were used for study. For 
TMRM labelling, the fibres were incubated in IM containing 5nM TMRM for 
>20 min. For regional uncoupling of mitochondria, MitoPhotoDNP”, a gift from 
S. Caldwell and R. Hartley (University of Glasgow), was used. In this preparation, 
we found that we needed an incubation concentration of 20 [1M for approximately 
30 min at room temperature to achieve adequate loading of the mitochondria. The 
loaded cells with TMRM and MitoPhotoDNP still in the IM were transferred to a 
Petri dish with a no. 1.5 coverglass bottom (MatTek) treated with Cell-Tak (BD 
Bioscience) for microscopy. 

Confocal microscopy was conducted on an inverted ZEISS 780 with a 40X, 1.2 
NA water objective. 976 X 976 pixel, 12-bit images of 106.3 X 106.3 1m were 
collected while TMRM was excited with a 561 nm laser at 0.2% power. Emitted 
light was collected from 570 nm to 695 nm. To photoactivate MitoPhotoDNP, an 
ROI was drawn along the longitudinal axis within the centre of the cell and 
exposed to UV light from a 355nm laser (Coherent Inc.) at 40% power for 10 
iterations. It is important to note that the mild depolarization used here is of 
similar magnitude to the full change in membrane potential of mitochondria 
moving from resting to maximally active states (so called State 4 to State 3 trans- 
ition) where the ~200 mV membrane potential decreases by only 10-20%. To be 
physiologically relevant, any conductive mechanisms must be responsive to small 
changes in membrane potential. In control experiments, fibres which were not 
loaded with MitoPhotoDNP were exposed to an identical dose of UV light in an 
ROI in the centre of the cell. Experiments were only performed on fibres with a 
strong and homogeneous TMRM signal as low and heterogeneous signals are 
potential signs of a damage caused by the isolation process. 


Quantitative image analysis was performed using ImageJ on an average of five 
images taken before (Pre) and of five images taken immediately after UV expo- 
sure/MitoPhotoDNP release (Post). Image acquisition time varied slightly 
depending on cell width but averaged ~200 ms per frame. Both Pre and Post 
images were amplitude thresholded to separate the mitochondrial and cytoso- 
lic+nuclear TMRM signals. Cells were then analysed by cell region: paravascular, 
intra-fibrillar and irradiated. The irradiated region is the area in the centre of the 
cell exposed to UV light, and the boundary between the paravascular and intra- 
fibrillar regions is clearly distinguishable due to the regularity of the low cytosolic 
TMRM signal in the intra-fibrillar region where myofibrils are located. The 
mean TMRM signal was then measured for all mitochondrial pixels and all 
cytosolic+ nuclear pixels in each cell region in both the Pre and Post images. 
From these values, a Post/Pre ratio was calculated for the mitochondria and 
cytosol+ nuclei in each cell region. The ratio between the mitochondrial and 
cytosolic+nuclear TMRM intensities (mito/cyto ratio) was also calculated for 
each cell region in the Pre and Post images as this is a more valid measure of 
mitochondrial membrane potential than just the mitochondrial signal alone. 
Live cell mitochondrial morphology. Super-resolution microscopy was also 
performed on the live, isolated fibres loaded with TMRM using stimulated emis- 
sion depletion (STED) methodology. Although the spatial resolution of confocal 
microscopy is theoretically sufficient to measure mitochondrial cross-sectional 
diameters (mean values >300 nm) in these cells, the highly dense nature of the 
PVM pool makes it difficult to resolve the spaces between mitochondria. Thus, the 
improved spatial resolution with STED provides better definition of individual 
PVM structures. Gated STED images were obtained using a commercial Leica SP8 
STED 3X system (Leica Microsystems, Mannheim, Germany), equipped with a 
white light excitation laser. A 100X/1.4 NA oil immersion objective lens (HCX PL 
APO STED white, Leica Microsystems) was used for imaging. Using TMRM as the 
mitochondrial probe for STED allowed the measurement of mitochondrial struc- 
ture under the same conditions used to assess mitochondrial membrane potential 
conduction. Further, TMRM is highly recommended by Leica for live cell STED as 
it is very bright and highly photostable, which is supported by the results of the UV 
irradiation experiments described above. Also, tetramethylrhodamine has been 
used previously to collect STED images in live cells**. Image stacks were collected 
as 8 bit, 1,024 X 1,024 images with 30 nm x and y pixel sizes, 6 line averages and 
0.2 jum z-steps. Muscle cells were imaged with 550 nm excitation light, a 660 nm 
STED depletion laser at 20% power and an emission window from 564nm to 
640nm. Tandem confocal images (no STED depletion, 3X lower excitation 
power) were also obtained with confocal and STED images collected sequentially, 
frame by frame, with the confocal image first. 

Measurement of mitochondrial diameters was performed in ImageJ using the 
raw STED image stacks. A line was drawn across the maximal cross-sectional 
diameter and the length determined as the full width at half maximum 
(FWHM) of the linear fluorescent intensity profile. We analysed 175 mitochon- 
dria from 13 muscle fibres from 3 animals (see Extended Data Table 1). STED 
image stacks were deconvolved using Huygens software and the CMLE algorithm 
for presentation purposes only. 

Fast Fourier transform (FFT) IBM analysis. In the isolated muscle fibres, IBM 
were generally spaced every ~2 Um adjacent to the z-lines in the isolated cells. 
Since depolarization dissipates the TMRM signal from the mitochondria to the 
cytosol, we reasoned that an FFT of the TMRM signal along the long axis of the cell 
would selectively highlight, due to its periodicity, the IBM pool signal alterations. 
To accomplish this, a large ROI of the intra-fibrillar region was selected, eliminat- 
ing the PVM pool. The FFT were conducted serially along the long axis of the cell 
summing the power spectrum of each FFT to provide the overall power spectrum 
of the ROI. The IBM represented a unique resonance in the 0.5 1m~’ region 
permitting the specific analysis of the IBM amplitude using this approach. 
Statistical analyses. Differences in mitochondrial diameter for the four mitochon- 
drial pools were assessed using a one-way ANOVA with a Tukey’s HSD post hoc 
test anda P value of 0.05. Differences in the relative distribution of complex IV and 
complex V between the PVM and IFM pools were determined using paired t-tests 
and a P value of 0.05. Differences between isolated muscle fibres before and after 
UV exposure and for different cell regions were assessed by ANOVA with a 
Tukey’s HSD post hoc test and a P value of 0.05. 
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Extended Data Figure 1 | Mitochondrial coupling assessment. a, b, Single 
image from the middle of a FIB-SEM muscle volume including two nuclei (N) 
and a blood vessel (V). a, Direct coupling. PVM tagged green are those where a 
single mitochondrion projects from a PVM into an IBM with a continuous 
outer membrane (see inset) within the imaged volume. Red-tagged PVM did 
not project into IBM. Blue-tagged PVM continued out of the field of view 
and could not be classified. b, Electron-dense contact site (EDCS) coupling. 
Magenta-tagged mitochondria were connected by EDCS (see yellow arrows in 
inset). Additional green dots were added to PVM which project into IBM 
(see a). Greater than 99% of tracked PVM were coupled by EDCS. ¢, d, Single 
image from the middle of the intra-fibrillar region of a FIB-SEM volume. 
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c, Direct coupling. FPM tagged green are those where a single mitochondrion 
projects from an FPM into an IBM with a continuous outer membrane 

(see inset). Yellow-tagged FPM did not project into IBM. d, EDCS coupling. 
FPM tagged blue were connected to an adjacent FPM through an EDCS 

(see yellow arrows in inset). Images are representative of 4 samples with 
significant paravascular and intra-fibrillar mitochondrial content from 4 
animals. e, A longitudinal TEM mouse tibialis anterior muscle image showing 
the close association between adjacent mitochondria. f, Close up view of an 
EDCS highlighted in e showing the convergence of mitochondrial membranes 
between two adjacent mitochondria. Scale bars, 750 nm. 
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Extended Data Figure 2 | Multi-photon microscopy (MPM) images of fresh 
muscle fibres in situ. a, Endogenous mitochondrial NAD(P)H signal from 
a single XY image within a 3D volume of a muscle fibre. In this orientation, 
paravascular mitochondria (PVM) are apparent as clusters around the 
embedded capillary (V) and nuclei (N). Fibre parallel mitochondria (FPM) are 
seen as horizontal lines while I-band mitochondria (IBM) are seen as discrete 
spots. There are very few vertical lines in this image due to the infrequency 
of cross-fibre connection mitochondria (CFCM). The full volume can be seen 
in Supplementary Video 3. b, A YZ image from the same fibre volume as 

in a. PVM appear lateral to the embedded capillary (V), while IBM appear as 
vertical lines and FPM are seen as discrete spots. Note the lack of horizontal 
lines (CFCM) in this image or the accompanying video (Supplementary 
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Video 4). c, An XZ image of the same muscle volume as in a and b. PVM are 
located on the cell periphery; IBM are seen as vertical lines; FPM are seen as 
horizontal lines; and CFCM as discrete spots. In the inset, pairs of IBM 
projecting out from PVM are highlighted. d, A 3D rendering of the endogenous 
mitochondrial NAD(P)H signal within the muscle fibre shown in a-c. 360° 
views of this 3D rendering are shown in Supplementary Video 5. e, A view 
of the interior of the 3D rendering from d showing the regularity of the 
mitochondrial network within skeletal muscle. The field of view for the muscle 
volume in this image is 102.4 X 51.2 X 36.8 jm in x, y and z, respectively. 
Scale bars, 15 zm. Mean fibre volume from these MPM images was 

159,431 + 15,507 uum?, Images are representative of 4 fibres from 3 mice. 
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Extended Data Figure 3 | Super resolution microscopy allows improved d, image volume was 24.30 X 27.47 X 1.75 uum in x, y and z, and images shown 
visualization of individual mitochondria. a, Single confocal microscopy are the average of 9 images taken with 219 nm z-steps. Confocal images 
image of an isolated muscle fibre loaded with mitochondrial membrane were acquired before STED images at each image depth. All images were 
potential dye, TMRM. PVM, paravascular mitochondria; IFM, intra-fibrillar acquired with x and y pixel sizes of 30 nm. Scale bars, 2 um. As most muscle 
mitochondria; V, vessel. b, Single stimulated emission depletion (STED) mitochondria are larger than 300 nm, the improved resolution with STED is 
microscopy image of the same fibre as in a. c, The average of a confocal primarily noted by the increased clarity of the spaces between individual 
microscopy image stack of a TMRM loaded isolated muscle fibre. d, The mitochondria, particularly the PVM. Images are representative of 13 fibres 


average of a STED microscopy image stack of the same fibre as inc. Forcand from 3 mice. 
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Extended Data Figure 4 | Complex V expression is relatively higher than 
complex IV in the muscle fibre interior. a, Representative confocal 
microscopy image of a cross-section of a muscle fibre immunostained for 
complex IV. b, The same muscle section depicted in a immunostained for 
complex V. Scale bars, 15 jim. All images analysis was performed on the raw 
images. Brightness and contrast have been increased for the images shown 
here to improve presentation. c, Distance map from the boundary of the cell 
(yellow line) towards the interior of the fibre where image intensity corresponds 
to distance from the cell boundary. The mean fluorescence signal for all 
pixels of a given distance from the cell boundary was assessed for both 
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complex IV and complex V. Images representative of 4 fibres from 4 mice. 

d, Results from intensity profile analysis of 4 muscle fibres from 4 animals show 
relatively increased complex V in the fibre interior or, conversely, relatively 
higher complex IV near the fibre periphery. The red, solid line shows the mean 
intensity profile for complex V from the outside to the inside of the fibre. The 
green, dotted line shows the mean intensity profile for complex IV from the 
outside to the inside of the fibre. Values are normalized to the maximal intensity 
from each intensity profile to allow comparison of images of varying 
intensities. Grey shading around each line shows the standard error. 
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Extended Data Figure 5 | Primary antibody specificity. a, Coomassie- 
stained CN-PAGE gel after transferring isolated mouse skeletal muscle 
mitochondrial proteins to PVDF membrane for western blotting. 
Mitochondrial oxidative phosphorylation complexes I-V can be resolved as 
individual bands. b, Western blot dual immunostained for complex V, 
B-subunit (upper band), and complex IV, subunit I (lower band), with the 
same primary antibodies as used for the muscle section dual immunostaining 
as shown in Fig. 3 and Extended Data Fig. 4. 
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Extended Data Figure 6 | Sarcolemmal capillary grooves remain even after apparent as negative signals. b, c, Rotation of the volume in a around the 
removal of capillaries. a, Longitudinal view of a 3D rendering of the outer longitudinal (b) and cross-sectional (c) axes reveals the grooves in the 

10 jm of an enzymatically isolated muscle fibre imaged as the mitochondrial _ sarcolemma where capillaries were once present. Scale bars, 10 um. Images 
TMRM signal by confocal microscopy. Blood vessels (V) and nuclei (N) are representative of 11 fibres from 4 mice. 
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Extended Data Figure 7 | Additional analyses of mitochondrial membrane 
potential conduction. a, Confocal image of an isolated muscle fibre loaded 
with TMRM showing regions chosen for representative line profile analysis. 
Images are representative of 11 fibres from 4 mice. b, Line profile of TMRM 
fluorescence in the PVM~+nuclear (N) region of the cell (marked by yellow 
dotted lines in a) before and after activation of MitoPhotoDNP shows a 
decrease in PVM signal, increase in N signal, and no change outside of the cell 
where a blood vessel (V) was located before enzymatic fibre isolation. c, Line 
profile of TMRM fluorescence in the region of the cell irradiated with UV 
light (irradiated, marked by white dotted lines in a) showing a decrease in 
mitochondrial signal (peaks) and an increase in cytosolic signal (troughs) 
after MitoPhotoDNP activation. d, Line profile of TMRM fluorescence in the 
non-irradiated IFM region of the cell (intra-fibrillar, marked by red dotted lines 
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in a) showing a decrease in mitochondrial signal (peaks) and an increase in 
cytosolic signal (troughs) after MitoPhotoDNP activation. c and d take 
advantage of the regular pattern of IBM and cytosol in the intra-fibrillar space. 
However, any FPM in the regions selected for analysis may confound these 
results. A more robust analysis that selects only the IBM is shown in e and f. 
e, Representative fast Fourier transform (FFT) power spectrum of IFM TMRM 
signal reveals a 61 + 3% decrease (n = 9 experiments) in the amplitude of IBM 
component (0.5 sm frequency) after MitoPhotoDNP activation. Inset: log 
scale full FFT power spectrum. f, Representative FFT analyses of IFM 
mitochondria reveals no change (pre/post = 1.0 + 0.03, n = 5 experiments) in 
the TMRM amplitude of IBM (0.5 jim’ frequency) after UV exposure 

when MitoPhotoDNP is not present. Inset: log scale full FFT power spectrum. 
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Extended Data Figure 8 | Effect of UV exposure on membrane potential 
without MitoPhotoDNP. a, Representative heat map image of an isolated 
mouse soleus muscle fibre loaded with 5nM TMRM. N, nucleus; V, vessel. 
b, c, Image from a thresholded to include only the TMRM signal in the 
mitochondria (b) or cytosol+nuclei (c). d-f, Whole cell (d), mitochondrial (e) 
and cytosolic+nuclear (f) TMRM signals immediately after UV exposure in 
the centre region of the cell (outlined by the white dotted lines). g-i, Post/pre 
ratiometric whole-cell (g), mitochondrial (h), and cytosolic+ nuclear (i) 
TMRM images showing little change outside of the UV-exposed area. 
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j, Greyscale image of a highlighting the different cell regions used for 
quantitative analysis. All pixels were analysed. k, TMRM signal decreased in 
both the mitochondria and cytosol in the UV-exposed region indicative of 
slight photobleaching. Images are representative of 10 fibres from 3 mice. Data 
represent the mean = s.e. from 10 experiments from 3 animals. Asterisk 
indicates significantly different (ANOVA, P< 0.05) from a post/pre ratio of 
1.0. Plus symbol indicates significantly different (P < 0.05) from IFM+UV 
region. See Supplementary Video 10 for time course video. 
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Extended Data Figure 9 | UV light was restricted to the drawn ROI. that the UV light was largely maintained to the ROI drawn (black line). Scale 
Difference XY image of a uniformly blue fluorescent slide before and after bars, 5 um. Images are representative of 3 experiments. 
exposure to UV light in the drawn ROI (white outline). Inset: YZ view shows 
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Extended Data Table 1 | Skeletal muscle mitochondrial cross-sectional diameters 
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FIB-SEM, focused ion beam scanning electron microscopy; STED, stimulated emission depletion microscopy; PVM, paravascular mitochondria; IBM, I-band mitochondria; FPM, fibre parallel mitochondria; 
CFCM, cross fibre connection mitochondria. Statistical test was analysis of variance (P < 0.05). There were no significant differences between FIB-SEM and STED. Diameters were measured at the maximum cross- 
section of the short axis of the mitochondria which were roughly ellipsoidal or cylindrical in shape. STED diameters were determined by the full-width half-maximum (FWHM) of the linear fluorescence intensity 
profile, and FIB-SEM diameters were determined as the distance of a line drawn across a mitochondrion. 
* Denotes significantly different from PVM. 
‘Denotes significantly different from IBM. 
# Denotes significantly different from FPM. 


©2015 Macmillan Publishers Limited. All rights reserved 


Mae A A lea 


doi:10.1038/nature14482 


Molecular basis for 5-carboxycytosine recognition 
by RNA polymerase II elongation complex 


Lanfeng Wang'*, Yu Zhou*, Liang Xu'*, Rui Xiao”*, Xingyu Lu’, Liang Chen’, Jenny Chong’, Hairi Li*, Chuan He’, 


Xiang-Dong Fu? & Dong Wang! 


DNA methylation at selective cytosine residues (5-methylcytosine 
(5mC)) and their removal by TET-mediated DNA demethylation 
are critical for setting up pluripotent states in early embryonic 
development'”. TET enzymes successively convert 5mC to 
5-hydroxymethylcytosine (ShmC), 5-formylcytosine (5fC), and 
5-carboxylcytosine (5caC), with 5fC and 5caC subject to removal 
by thymine DNA glycosylase (TDG) in conjunction with base 
excision repair’ °. Early reports indicate that 5fC and 5caC could 
be stably detected on enhancers, promoters and gene bodies, with 
distinct effects on gene expression, but the mechanisms have 
remained elusive”*. Here we determined the X-ray crystal structure 
of yeast elongating RNA polymerase II (Pol II) in complex with a 
DNA template containing oxidized 5mCs, revealing specific 
hydrogen bonds between the 5-carboxyl group of 5caC and the 
conserved epi-DNA recognition loop in the polymerase. This 
causes a positional shift for incoming nucleoside 5’-triphosphate 
(NTP), thus compromising nucleotide addition. To test the 
implication of this structural insight in vivo, we determined the 
global effect of increased 5fC/5caC levels on transcription, finding 
that such DNA modifications indeed retarded Pol II elongation on 
gene bodies. These results demonstrate the functional impact of 
oxidized 5mCs on gene expression and suggest a novel role for 
Pol II as a specific and direct epigenetic sensor during transcrip- 
tion elongation. 

Epigenetic DNA methylation (S5mC) is an important regulator of 
gene transcription recognized by several families of protein readers, 
such as methyl-CpG-binding domain proteins (MBDs) and ubiquitin- 
like PHD and RING finger domain-containing proteins (for example, 
UHRF1), and certain zinc-finger proteins (kaiso; also known as 
ZBTB33)"°. TET enzymes iteratively oxidize 5mC to 5hmC, 5fC, 
and 5caC**, and TDG coupled with base excision repair further pro- 
cess 5fC/5caC to complete DNA demethylation (Fig. 1a)°*. An open 
question is whether 5fC and 5caC are simple DNA demethylation 
intermediates or have active roles in gene expression. 

Genomic mapping revealed specific enrichment of 5fC and 5caC 
at enhancers, promoters and gene bodies”*. Moreover, a number of 
protein complexes involved in transcription, splicing, chromatin 
remodelling and DNA repair have been identified to selectively 
bind synthetic DNA oligonucleotides containing oxidized 5mCs 
(oxi-mCs)'*"’. A previous study by our group indicates that these 
modifications induce transient pausing of purified yeast and mam- 
malian RNA polymerase II elongation complex (Pol II EC) in vitro™. 
Together, these observations imply that different oxi-mCs may influ- 
ence gene expression’*. Importantly, our work suggests that Pol II has 
the capacity to directly sense the demethylation state (oxi-mCs) of 
template DNA during transcription. 

To understand the molecular basis underlying the Pol II EC recog- 
nition of oxi-mCs, we performed structural studies of the complex 


assembled on an RNA/DNA scaffold that contains a 5caC at the 
i+1 site (Fig. 1b) to mimic the stage when Pol II EC encounters 
5caC during transcription elongation. This scaffold recapitulated the 
impediment of Pol II elongation in the in vitro reactions (Fig. 1c)'*. The 
crystal structure (EC-I) revealed that the upstream RNA/DNA hybrid 
region maintains a post-translocation state register in which the active 
site is empty and ready for NTP loading (Fig. 1d). About 50% of 5caC 
nucleobase (yellow coloured in Fig. 1d, h, see also Extended Data 
Fig. 1a, b) accommodates at a new translocation intermediate position, 
located about halfway between the canonical i+1 and i+2 sites. The 
other 50% of 5caC nucleobase is partially inserted into the i+1 position 
(cyan coloured in Fig. 1d, g). Importantly, we detected specific hydro- 
gen bonds between the 5-carboxyl moiety of 5caC and the side chain of 
residue Q531 at a loop in the fork region of Rpb2 (the second largest 
subunit) (Fig. le, f)'°. We termed it the ‘epi-DNA recognition loop’ or 
‘fork loop 3’, because it recognizes the epigenetic DNA modification in 
the major groove and is next to the previously identified fork loop 1 
and fork loop 2 within the fork region’’. The specific hydrogen- 
bonding interactions with 5caC result in a 90-degree rotation of the 
side chain of Q531 of the yeast Pol II Rpb2, switching its interacting 
partner in the upstream RNA/DNA hybrid region” to the nucleobase 
of 5caC at i+1 position register (Fig. le, f). This causes 5caC to shift 
into a new translocation intermediate position right above the bridge 
helix (Fig. le, f, h), which we termed the ‘midway position’. 

To investigate the potential impact of 5caC on nucleotide incorp- 
oration, we next solved the structure of the Pol II EC with a 5caC at the 
i+1 site in the presence of a non-hydrolysable GTP analogue 
(GMPCPP) to mimic the state of GTP binding opposite 5caC (EC- 
II). We found that while 5caC forms a canonical Watson-Crick base 
pair with GMPCPP (Fig. 2a and Extended Data Fig. 1c, d), the base pair 
shifts to another translocation intermediate position, ~1.5 A away 
from its canonical position towards the downstream main channel 
(Fig. 2b, d and Extended Data Fig. 2a). The interaction between the 
epi-DNA recognition loop and 5caC probably causes this positional 
shift (Fig. 2b-d), which disrupts the proper alignment between Rpb1 
L1081 and the substrate, as well as the correct positioning of the 3’- 
RNA terminus and the substrate that is crucial for full closure of the 
trigger loop and effective GTP addition'”’*. The nucleobase of the 
substrate now misaligns with Rpbl T831 in the bridge helix 
(Fig. 2a), leading to a partially open conformation of the trigger loop 
(Extended Data Fig. 2b). 

To determine further whether the specific hydrogen-bonding inter- 
action between the Pol II epi-DNA recognition loop (Rpb2 Q531) and 
5caC (Fig. 2b, c) causes a reduction in GTP addition efficiency, we 
purified two yeast Pol II point mutants (Rpb2 Q531H and Q531A) and 
measured GTP incorporation on the 5caC template in comparison 
with wild-type Pol II. The Pol II Q531A mutation abolishes the 
specific hydrogen bonds between the side chain of residue 531 and 
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Figure 1 | Pol II directly recognizes 5caC during transcription. a, Epigenetic 
modification cycle of cytosine. b, The RNA/DNA scaffold used in both 
structural and biochemical analysis. C* indicates 5caC residue. c, Impeded Pol 
II elongation on the 5caC-containing template relative to the unmodified C 
template. Time points are 0, 5 s, 15 s, 30s, 1 min, 5 min, 20 min, and 1 h (left to 
right). nt, nucleotides. d, The overall Pol II EC structure containing a site- 
specific 5caC (EC-I). Colour-coded are template DNA (blue), non-template 
DNA (green) and RNA (red). The two 5caC conformers are highlighted in 
yellow and cyan, respectively. Part of the bridge helix (BH; Rpb1 822-840) is 
highlighted in green and the rest of the Pol II subunits are in grey (Rpb2 is 


the 5-carboxyl moiety of 5caC, thus alleviating the negative impact of 
5caC on Pol II transcription. In contrast, the Q531H mutant should 
behave similarly to wild-type Pol II, as the His residue has the capabil- 
ity to form a hydrogen bond with 5caC. Indeed, we observed that the 
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Figure 2 | Interaction between 5caC and epi-DNA recognition loop 
compromises GTP incorporation. a, The Pol II EC structure containing a 
matched GMPCPP opposite the 5caC site (EC-II). The colour codes are the 
same as Fig. 1 except for 5caC (yellow) and GMPCPP (orange). b-d, The 
GMPCPP:5caC base pair is shifted towards the downstream main channel from 
the canonical GMPCPP:dC position (PDB accession 2E2J). The side chain of 
Rpb2 Q531 rotates 100° to interact with 5caC (b, c). e-g, Comparison of 
catalytic rate constants (k,1) (e), substrate dissociation constants Kg app (f) and 
specificity constants (kjoi/Kaapp) (g) of GTP incorporation opposite the 
5caC template by wild-type (WT), Q531H and Q531A Pol II, respectively. The 
mean values are presented and error bars are standard deviations derived 
from three independent experiments. 
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omitted). The addition site is represented by a dotted oval. e, The midway 
5caC interacts with the Rpb2 Q531 residue via hydrogen bonds (black dotted 
lines). The epi-DNA recognition loop (fork loop 3; Rpb2 521-541) is shown 
in cyan. f, The Q531 side chain rotates 90° to form hydrogen bonds with 5caC. 
Pol II EC-I is superimposed with the Pol II EC containing an unmodified 
DNA template in the post-translocation state (PDB accession 1SFO). The 
fork loop 3 region of Pol II EC (PDB accession 1SFO) is shown in orange. 

g, h, Comparison of two 5caC conformers (cyan or yellow) with the 
corresponding canonical template nucleotide (blue-white). 


Q531A mutant leads to a 2.6-fold increase in GTP incorporation 
specificity (catalytic rate constant (k,,1)/apparent dissociation constant 
(Kaapp)) (P value <0.0001, unpaired, two-sided t-test) with a faster k,o1 
and a tighter Kg,app, whereas the Q531H mutant behaves like wild-type 
Pol II (Fig. 2e-g and Extended Data Fig. 3). Consistently, abolishing the 
specific hydrogen-bonding interaction with the Pol II epi-DNA recog- 
nition loop by removal of the 5-carboxyl moiety of 5caC (replacement 
of 5caC to unmodified C template) also leads to a 4.2-fold increase in 
GTP incorporation specificity’*. These data demonstrate the functional 
impact of 5caC on both the rate and specificity of GTP incorporation 
via Q531 in the Pol II epi-DNA recognition loop. 

The same epi-DNA recognition loop is equally capable of forming 
similar interactions with the 5-carbonyl group of 5fC, but not with 
5mC or unmodified C (Extended Data Fig. 4). We also modelled the 
structure of 5hmC (Protein Data Bank (PDB) accession 4R2C) within 
the Pol II EC. Interestingly, the 5-hydroxyl group appears to adopt a 
different orientation away from the epi-DNA recognition loop 
(Extended Data Fig. 4c), consistent with less Pol II retardation on a 
5hmC template relative to a 5fC/5caC template™. Intriguingly, a recent 
study revealed that base J (-p-glucosyl-hydroxymethyluracil), a 
major groove DNA modification, prevents transcriptional read- 
through in Leishmania’. We observed a striking similarity between 
5fC/5caC and base J on slowing down or stalling Pol II transcription, 
suggesting a likely universal mechanism for Pol II to sense distinct 
DNA modifications via the epi-DNA recognition loop”. 

The critical Gln residue (Q531 in yeast Rpb2) is conserved among 
several fungal species containing active TET/JBP enzymes and oxi- 
mCs, such as Agaricomycetes and Pucciniomycete, and is substituted 
by the functionally equivalent His residue in mammals (Extended Data 
Fig. 5a)*°, suggesting that similar interactions between 5caC and the 
Pol II epi-DNA recognition loop are probably maintained from fungi 
to humans (Extended Data Fig. 5b-d). Indeed, we observed impeded 
elongation by human Pol II in HeLa nuclear extracts (Extended Data 
Fig. 6) and with purified rat Pol II on the 5caC-containing template 
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relative to the unmodified C template’*. The conservation of this 
critical Gln/His residue in eukaryotes coincides with the existence 
of 5fC/5caC modifications. In contrast, bacteria and archaea RNA 
polymerases carry Ala or Pro at the corresponding position in the 
Bp loop II region (Extended Data Fig. 5a)*', and consistently, we found 
that 5caC has no observable effect on Escherichia coli RNA polymerase 
transcription elongation in vitro (Extended Data Fig. 7, bottom). It is 
interesting to note that glycosylated cytosine derivatives are also pre- 
sent in some phage and bacterial DNA genomes, pointing to future 
investigations to understand how these modifications (bulkier than 
5fC/5caC) may be recognized during transcription. 

Our structural studies also shed light on the canonical Pol II trans- 
location process by revealing two new translocation intermediate posi- 
tions of the 5caC template before and after GTP binding. The first 
translocation intermediate position of 5caC sits above the bridge helix 
in the absence of GTP. Upon GTP binding, the 5caC template is shifted 
to a new translocation intermediate position allowing the formation of 
a base pair with incoming GTP. The translocation intermediate states 
are similar to the translocation intermediate states on an unmodified 
DNA template recently suggested by molecular simulation”. Our abil- 
ity to capture the crystal structures of these Pol II translocation inter- 
mediates suggests that the specific interactions between the Pol II 
recognition loop (Q531) and the 5-carboxyl group of 5caC stabilize 
the translocation intermediates that are otherwise too transient to be 
captured on unmodified DNA template. 

Aligning the structures of 5caC-paused Pol II EC with bulky DNA 
lesion-arrested or %-amanitin-arrested Pol II EC’*”** reveals addi- 
tional insights into Pol II pausing and arrest. Notably, 5caC, CPD, 
pyriplatin-dG and the i+1 transition template base in o-amanitin- 
arrested Pol II EC are all accommodated above the bridge helix 
(Fig. 3a—-c), even though their exact locations, orientations and inter- 
actions with Pol II greatly differ (see Methods). A similar (but not 
identical) ‘above-the-bridge-helix’ translocation intermediate has also 
been recently observed in an elemental paused E. coli RNA polymerase 
structure (ePEC) with a kinked bridge helix to occlude the canonical 
i+1 template position’®. Taken together, we propose that these obser- 
vations point to a common translocation checkpoint that serves as a 
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Figure 3 | Similar ‘above-the-bridge-helix’ translocation intermediates 
captured in pausing/arrested Pol II ECs and a common 5caC-recognition 
mode shared bya variety of 5caC-recognition proteins. a—c, Superimposition 
of 5caC-paused Pol II EC with CPD-lesion-arrested EC (PDB accession 4A93) (a), 
pyriplatin-lesion-arrested EC (PDB accession 3M40) (b), and «-amanitin- 
arrested EC (PDB accession 2VUM) (c). The similar above-the-bridge-helix 
translocation intermediates region for accommodation of the i+1 5caC 
(yellow) and DNA lesion (or translocation intermediate captured by 
.-amanitin) (blue-white) is highlighted by a red-dotted oval. The damage- 
arrested or &-amanitin-arrested Pol II ECs are shown in grey. BH, bridge helix. 
d-f, The conserved interactions and residues involved in 5caC recognition by 
Pol II (Rpb2 Q531) (d), human WT1 (Q369; PDB accession 4R2R) (e), and 
human TDG (N157; PDB accession 3U07) (f). 
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rate-limiting step for the transition of the DNA template nucleobase to 
cross over the bridge helix and subsequently insert into the canonical 
i+1 site to guide RNA synthesis. While DNA lesions have been pro- 
posed to interfere with Pol II elongation via steric hindrance**”’, our 
current data suggest that Pol II can also directly sense epigenetically 
modified DNA (5caC/5fC) through specific hydrogen-bonding 
interactions. 

We further noticed a remarkable mechanistic similarity in 5caC 
recognition by several unrelated family proteins. For example, residue 
Q369 in human Wilms tumor protein 1 (WT1) (PDB accession 
4R2R)'? and residue N157 in human TDG (PDB accession 3U07)* 
are both functionally equivalent to Q/H531 residues in Pol II in 
recognizing the 5caC carboxyl group via specific hydrogen bonds 
(Fig. 3d-f). We thus speculate that 5caC could be a potential epi- 
genetic mark for recognition by a variety of ‘protein readers’ (including 
Pol II itself) via specific hydrogen-bonding interactions with its 
5-carboxyl moiety. 

To determine further the functional consequences of oxi-mCs on 
Pol II transcription elongation in mammalian cells, we measured the in 
vivo transcription elongation rate on a pair of isogenic mouse embry- 
onic stem (ES) cells (Tdg™" wild-type mouse ES cells and Tdg ’~ 
knockout mouse ES cells derived from conditional TDG-knockout 
mice) by global nuclear run-on coupled with deep sequencing 
(GRO-seq) (Fig. 4a). Previous studies showed that, relative to wild 
type, TDG knockout led to a substantial increase of global 5£C/5caC 
levels”*®. The GRO-seq experiments allowed us to measure the front 
edge of waves of nascent transcripts at different time points to deduce 
the rate of Pol II transcription elongation. 

We observed retarded Pol II elongation in TDG-knockout ES cells 
relative to wild-type ES cells after the functional impact was sufficiently 
accumulated, as exemplified on the long Myole gene (Fig. 4b). Further 
metagene analysis of the middle points of wild-type and TDG-knock- 
out mouse ES cells at different time points revealed a clear reduction of 
Pol II elongation in TDG-knockout relative to wild-type ES cells after 
30 min of synchronized transcription, although the differences at earl- 
ier times were not evident (Fig. 4c). We next analysed the GRO-seq 
read density gene by gene +10 kb around individual middle points 
followed by linear regression to determine the slope (Fig. 4d). We 
observed progressive slowing down of Pol II in TDG-knockout ES 
cells relative to wild-type cells, as indicated by decreasing slopes, and 
the read density ratio (TDG knockout/wild type) at 30 min was sig- 
nificantly smaller relative to control (P value = 1.52 X 10°’ from 
one-sided Kolmogorov—Smirnov test) (Fig. 4d). Finally, to determine 
the dosage-dependent effect of 5fC/5caC on Pol II transcription 
elongation, we focused on the data at 30 min and segregated genes 
into two groups according to increased levels of 5fC/5caC in response 
to TDG knockout and compared the middle points at individual assay 
points. The data indicate a correlation between increased 5fC/5caC 
and a reduced transcription elongation rate among genes in group 2 
(high 5£C/5caC level) relative to group 1 (low 5£C/5caC level) (Fig. 4e). 
Together, these global data demonstrate retarded Pol II elongation 
by enhanced 5fC/5caC levels in the gene body. The combination of 
in vitro and in vivo data strongly indicates a direct impact of 5£C/5caC 
on Pol II elongation on the DNA template. 

We present structural and biochemical evidence to suggest that 
Pol II has the ability to sense the DNA oxidative methylation state 
directly through its conserved epi-DNA recognition loop, and that it 
transiently slows down at oxi-mC (5fC/5caC) sites during transcrip- 
tion. Since 5fC/5caC are not distributed evenly across the genome and 
show considerable variation between cell types, it is conceivable that 
these pausing effects may add a new layer of fine-tuned regulation of 
Pol II transcription elongation dynamics. For example, compared with 
transcription of short genes, the cumulative consequences of pausing 
effects at 5fC/5caC sites probably have much more profound regula- 
tory impacts on transcribing some long genes that are preferentially 
expressed in the brain and have crucial roles in neuronal integrity”. 
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Figure 4 | Impact of 5fC/5caC on Pol II transcription elongation in mouse 
ES cells. a, Scheme of the DRB releasing assay. Wild-type (WT) and TDG- 
knockout (KO) mouse ES cells (mESC) were treated with DRB followed by 
washing out DRB to allow transcription for 10, 20 or 30 min (10M, 20M and 
30M, respectively). No DRB treatment (NODRB) or 3 h DRB treatment 
(DRB3H) were performed as controls. All experiments were performed in 
duplicate and reproducibility was evident in all pairwise comparisons 
(Extended Data Fig. 8). b, The GRO-seq data on the representative Myole gene. 
Elevated 5fC/5caC levels in TDG-knockout mouse ES cells are derived from 
published chromatin immunoprecipitation followed by sequencing (ChIP-seq) 
data in duplicate*. Chr, chromosome. c, Comparative metagene analysis of 
GRO-seq signals between wild-type (top) and knockout mouse ES cells 
(bottom). Dashed and non-dashed lines show the middle points of the 
ensemble transcription waves in wild-type and knockout mouse ES cells, 


The transient Pol II pausing at 5fC/5caC sites may also provide signals 
for the recruitment of various transcription elongation factors, chro- 
matin remodelling complexes, messenger RNA processing machiner- 
ies, and TDG and the base excision repair machineries to the oxi-mC 
sites to induce additional functional consequences. On the basis of the 
similarity between direct Pol II recognition of 5caC and the role of 
Pol II in sensing bulky DNA lesions in transcription-coupled nucleo- 
tide excision repair, we propose that Pol II may act as a direct sensor for 
a variety of DNA modification and damage events to instruct distinct 
downstream pathways**?””°. 

Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Preparation of Pol II ECs. Saccharomyces cerevisiae Pol II was purified as prev- 
iously described’’. PAGE-purified RNA oligonucleotides were purchased from 
Dharmacon, non-template DNA oligonucleotides were obtained from IDT, and 
template DNA oligonucleotides with 5caC were prepared and purified as prev- 
iously described’*. The template DNA, non-template DNA and RNA oligonucleo- 
tides were annealed to form the scaffold. To form the Pol II EC, Pol II was mixed 
with scaffold in the reaction buffer (20 mM Tris (pH 7.5), 40 mM KCl and 5 mM 
dithiothreitol (DTT)). The final concentrations were 2 1M Pol II, 10 .M template 
DNA and 20 LM non-template DNA and RNA oligonucleotides. The mixture was 
incubated at room temperature for 1 h, followed by ultrafiltration to remove excess 
oligonucleotides. The Pol II elongation complex was crystallized using the hanging 
drop method and from solutions containing 390 mM (NH4)2HPO,/NaH2PO,, pH 
6.0, 50 mM dioxane, 10 mM DTT and 10.7-11.6% (w/v) PEG6000. Crystals were 
transferred in a stepwise manner to cryo buffer as previously described’’. For the 
Pol II EC with GMPCPP, Pol II EC crystals were soaked with 5-10 mM GMPCPP 
and 10 mM MgCl, overnight before harvest. 

Data collection and structure determination of 5caC-paused Pol II ECs. 
Diffraction data were collected on beamlines 8.2.1 and 5.0.2 at the Advanced 
Light Source, Lawrence Berkeley National Laboratory. Data were processed in 
DENZO and SCALEPACK (HKL2000)*’. Model building was performed with 
the program Coot”, and refinement was done with REFMACS5 with TLS 
(CCP4i) or PHENIX (Extended Data Table 1)*’. Electron density maps are shown 
in Extended Data Fig. 1. EC-I refers to the Pol II EC crystal structure that contains 
a site-specific 5caC at the i+1 site in the absence of GTP binding. EC-II refers to 
the Pol II EC crystal structure that contains a site-specific 5caC at the i+ 1 site in the 
presence of GMPCPP. Ramachandran plot of EC-I showed 85.57%, 11.70% and 
2.73% of EC-I residues are in preferred, allowed and disallowed regions, respect- 
ively. For EC-II, 86.00%, 11.22%, and 2.78% of residues are in above regions, 
respectively. All structural models in the figures were superimposed with the bridge 
helix region (Rpb1 822-840) near the active site using Coot” and PYMOL™. 

Pol II purification for in vitro transcription assay. S. cerevisiae Pol II and 
mutants were purified essentially as previously described’’. Briefly, Pol II (with 
recombinant protein A tag at Rpb3 subunit) was first affinity-purified by IgG 
column. The Pol II elution from IgG column was further purified using HiTrap 
Heparin and Mono Q (GE Healthcare). The final pure Pol II (Extended Data 
Fig. 3d) was ready for future in vitro transcription experiments. 

In vitro transcription assays. The S. cerevisiae Pol II ECs for transcription assays 
were assembled using established methods". Briefly, an aliquot of 5’-**P-labelled 
RNA was annealed with a 1.5-fold amount of template DNA and a 2-fold amount 
of non-template DNA to form RNA/DNA scaffold in elongation buffer (20 mM 
Tris-HCl, pH 7.5, 40 mM KCl and 5 mM MgCl). An aliquot of annealed scaffold 
of RNA/DNA was then incubated with a fourfold excess amount of Pol II at room 
temperature for 10 min to ensure the formation of Pol II EC. The in vitro tran- 
scription started when the Pol II EC was mixed with equal volumes of GTP 
solution. The final concentrations were 25 nM scaffold, 100 nM Pol II and 
1M GTP in the elongation buffer. Reactions were quenched at various time 
points by the addition of one volume of 0.5 M EDTA (pH 8.0). (Time points 
are 0, 5s, 15 s, 30s, 1 min, 5 min, 20 min, and 1 h). The quenched products were 
analysed by denaturing PAGE and visualized using a storage phosphor screen and 
Pharos FX imager (Bio-Rad). The in vitro transcription assay of E. coli RNA 
polymerase (RNAP, New England Biolabs (NEB)) was performed using the same 
procedure as S. cerevisiae RNA Pol II transcription. 

For the transcription of human Pol II in the nuclear extract of HeLa cells (Life 

Technologies), the excess annealed scaffold was incubated with nuclear extract of 
HeLa cells for 5 min before the addition of «-**P-GTP. The final concentrations 
were 1 tM scaffold, 1 1M «-32P-GTP (0.2 Ci pl”) and 3 mg ml’ protein of 
nuclear extract. Reactions were then quenched at various time points by the 
addition of one volume of 0.5 M EDTA (pH 8.0). The quenched products were 
analysed by denaturing PAGE and visualized using a storage phosphor screen and 
Pharos FX imager (Bio-Rad). All transcription assays described earlier were per- 
formed independently in triplicates. 
In vitro RNA pol II transcription kinetic assay and analysis. The assay was 
carried out as previously described"’. Briefly, nucleotide incorporation assays were 
conducted by pre-incubating 50 nM annealed scaffold containing a site-specific 
5caC modification at the template with 200 nM purified Pol II (wild type, Q531H 
and Q531A) for 10 min in elongation buffer at room temperature. The Pol II EC 
was then mixed with an equal volume of solution containing 40 mM KCl, 20 mM 
Tris-HCl (pH 7.5), 10 mM DTT, 10 mM MgCl, and twofold concentrations of 
various nucleotides. Final reaction concentrations after mixing were 25 nM scaf- 
fold, 100 nM Pol I, 5 mM MgCl, and various nucleotide concentrations in 
elongation buffer. Reactions were quenched at various times by addition of one 
volume of 0.5 M EDTA (pH 8.0) and analysed by denatured PAGE. 


Nonlinear-regression data fitting was performed using Prism 6. The time 

dependence of product formation was fit to a one-phase association equation 
(product = Ae fs © + ©) to determine the observed rate (Kops). The substrate 
concentration dependence was fit to a hyperbolic equation (ko; = kpo1 [substrate]/ 
(Kapp + [substrate])) to obtain values for the maximum rate of NTP incorpora- 
tion (kpo1) and apparent Ky (Kapp) governing NTP binding essentially as 
described. The specificity constant was determined by kyoi/Ka,app- 
Cell culture and in vivo transcription rate measurement. Wild-type mouse ES 
cells (Tdg” uf ) and TDG-knockout mouse ES cells (Tdg ‘ ~) were cultured in 
KnockoutTM DMEM (Life Technologies, catalogue no. 10828-018) supplemen- 
ted with 15% KnockoutTM Serum Replacement (Life Technologies, catalogue no. 
10828-028), 2 mM L-glutamine (Life Technologies, catalogue no. 25030-081), 1x 
non-essential amino acids (Life Technologies, catalogue no. 11140-050), 1x 
penicillin-streptomycin (Life Technologies, catalogue no. 15140-122), 0.1 mM 
2-mercaptoethanol (Life Technologies, catalogue no. 21985-023), 1,000 U ml"? 
LIF (Millipore, catalogue no. ESG1106), 3 uM CHIR99021 (Stemgent, catalogue 
no. 04-0004) and 1 uM PD0325901 (Stemgent, catalogue no. 04-0006). The DRB 
releasing GRO-seq assays were carried out in mouse ES cells under both wild- 
type and TDG-knockout conditions. For each time-course assay, there are five 
samples prepared for GRO-seq: (1) NODRB (without DRB treatment); (2) 
DRB3H (DRB treatment for 3 h, and this is the 0 time point); (3) 10M (10 
min after washing out DRB); (4) 20M (20 min after washing); (5) 30M (30 min 
after washing). For DRB treatment, we grew cells in a 10 cm plate to 70-80% 
confluence, treated cells by addition of DRB (Sigma) at a final concentration of 
100 mM to the culture medium and incubated for 3 h in the incubator, removed 
DRB by quick washing cells three times with PBS, then incubated in fresh 
medium in the incubator to different time points. GRO-seq was implemented 
as previously described***, and the GRO-seq libraries were subjected to 
Illumina Hiseq 2000 and 2500 for sequencing. 

For each sequencing sample, the sequenced reads were trimmed by removing 3’ 
adaptor and polyA sequences, and only those longer than 16 bp were used to map 
the mouse genome (mm9) with Bowtie (version 0.12.7), with parameters “-best- 
strata -125 -n2 -k1 -m10”*’. Only one read was kept for those reads mapped to the 
same location and strand. 

To measure the concordance between replicated samples, we counted the num- 
ber of the GRO-seq reads in all annotated genes (UCSC refGene) and did pairwise 
comparisons”. Transcripts with the same start and end positions were used once. 
Having established the data reproducibility, we combined replicated data sets for 
comparison between biological conditions at different assay points. 

To estimate the Pol II elongation rates, we computed the metagene profiles for 
all assay points. Only the genes with RPKM = 0.5 in the NODRB sample were kept 
for meta-analysis. The genes were aligned at TSSs, and mapped reads were counted 
in 100 bp bins across the gene bodies. The counts were normalized to one million 
total reads per sample, and were averaged for each bin by the number of covering 
genes and normalized by the relative gene expression in the NODRB sample. The 
meta-profiles from the normalized counts were smoothed with a 1 kb moving 
window. The middle point of the ensemble transcription wave at each time point 
after washing DRB was computed as the position at which the signal reached half 
of that in the NODRB control sample. 

To compare elongation differences at different assay points on individual genes, 
we calculated the GRO-seq read density in +10 kb window around the middle 
points identified in wild-type mouse ES cell cells. At each assay point (10M, 20M, 
30M), the counts for each gene in wild-type and TDG-knockout conditions were 
pairwise compared and linear regressions were fitted to check the trend of change. 
The samples without DRB treatment were used as control. The changes of 5fC/ 
5caC levels on genes were calculated as the differences of normalized ChIP-seq 
signals under 5fC/5caC ChIP-seq peaks from knockout to wild type based on the 
published ChIP-seq data from ref. 8, and the genes were divided into two groups 
with low and high increased 5fC/5caC levels. 

Comparison of 5caC-paused, DNA lesion-arrested, and a-amanitin-arrested 
Pol II EC. All the structures were aligned by superimposition of the Pol II bridge 
helix region (residues 822-840 in Rpb1). The i+ 1 5caC, CPD, pyriplatin-dG and 
i+1 transition template base in «-amanitin-arrested Pol II EC are all accommo- 
dated above the bridge helix, even though their exact locations, orientations and 
interactions with Pol II greatly differ: x-amanitin appears to capture the Pol II 
translocation intermediate indirectly by jamming the movement of the Pol II 
bridge helix and trapping the trigger loop in an inactive conformation, whereas 
the conformation of CPD and pyriplatin-DNA lesions is largely governed by their 
covalent crosslink or bulky ligand. In contrast to all of these previous cases, the 
translocation intermediate of 5caC nucleobase forms a direct interaction with the 
Pol II epi-DNA recognition loop. Second, the upstream RNA/DNA hybrid adopts 
essentially the same post-translocation register for all of these paused/arrested Pol 
II ECs, except that the 3’-RNA/DNA hybrid is substantially tilted for CPD-lesion- 
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arrested Pol II EC. Finally, while 5caC and pyriplatin-DNA lesion can form 
Watson-Crick base pairs with incoming NTP, thus allowing template-dependent 
nucleotide addition, the CPD-DNA lesion fails to form such a base pair with the 
incoming nucleotide, therefore only allowing template-independent ATP incorp- 
oration. In contrast to all of these previous cases, the translocation intermediate of 
the 5caC nucleobase forms a direct interaction with the Pol II epi-DNA recog- 
nition loop. 
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Extended Data Figure 1 | Electron density maps of Pol II EC-I and EC-II. 
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Extended Data Figure 2 | Structural comparison between Pol II EC-I, EC-II _ b, Superposition of Pol II EC-II containing 5caC template and GMPCPP with 
and Pol II EC containing unmodified C template and a matched GTP. Pol II EC with closed trigger loop (TL; containing unmodified C template 

a, Superimposition of Pol II EC-I and EC-II structures. Rpb2 Q531 and5caCin and GTP; PDB accession 2E2H). The two structures are aligned using the 
EC-II are in magenta to differentiate between those counterparts in EC-I. These _ bridge helix region (Rpb1 822-840). 

two structures are aligned using the bridge helix (BH) region (Rpb1 822-840). 
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Extended Data Figure 3 | Kinetic study of GTP incorporation opposite incorporation opposite 5caC template for Pol II wild type (WT; a), Pol II 
5caC template by purified Pol II proteins. a—c, Representative kinetic Q531H (b) and Pol II Q531A (c). d, Purified Pol II wild-type, Pol II Q531H and 
parameter fitting curves from three independent experiments for GTP Pol II Q531A proteins used in the in vitro transcription experiments. 
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and GMPCPP in EC-II. b, Model of the interaction between Pol ITEC with 5f{C unmodified C template. The above models were derived from the Pol II EC-II 
template through the same hydrogen-bond interaction network. c, Model of __ structure. 
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Extended Data Figure 5 | Sequence alignment of Pol II epi-DNA the green box. b, Hydrogen bonds (black dotted lines) between yeast Pol II 
recognition loop across different species. a, Pol II epi-DNA recognitionloop  Rpb2 Q531, 5caC and GMPCPP in EC-IL. c, Model of human Pol II with the 
(Rpb2 521-541) is conserved from fungi to human and strictly conserved functionally equivalent His substitution based on EC-II structure. 


among several fungal species, highlighted with magenta dotted rectangle, which d, Comparison between Q531 and H531 substitution reveals the similar 
contain active TET/JBP enzymes". Key residues in the loop are highlighted in hydrogen-bonding interaction. 
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Relative Rate 


LETTER 


Extended Data Figure 6 | Human Pol II slows down at 5caC template in 
comparison with unmodified template in the context of HeLa nuclear 
extract. The relative transcription elongation rate is normalized by the 
transcription elongation rate (k,,,) from unmodified template. The relative 
rates from unmodified template and 5caC template are coloured in black and 
grey, respectively. The error bars are standard deviations derived from three 


independent experiments. 
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Extended Data Figure 7 | Comparison transcription on 5caC template with are 0, 5s, 15s, 30s, 1 min, 5 min, 20 min, and 1 h (left to right). The top panel is 
unmodified template using purified yeast Pol II and E. coli RNAP. Top, 


identical to Fig. 1c and is placed here for direct comparison. nt, nucleotides. 
comparison of yeast Pol II; bottom, comparison of E. coli RNAP. Time points 
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Extended Data Figure 8 | Correlation between two replicates of GRO-seq density of points or genes. The Pearson correlation coefficients were calculated 
data sets at different assay points. GRO-seq replicates (—1 and —2) were from the points and are shown on the top of each subfigure. rpm, reads per 
pairwise compared gene by gene on the normalized number of reads for wild- _ million total reads. 

type (WT; left) and TDG-knockout (KO; right) samples. The colours show the 
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Extended Data Table 1 | Data collection and refinement statistics 
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EC-Il 


Data collection 
Space group 
Cell dimensions 

a, b, c (A) 

a, B,y (°) 
Resolution (A) 
Rais 
Isl 
Completeness (%) 


Redundancy 


Refinement 

Resolution (A) 

No. reflections 

Rwor/ Rice 

No. atoms 
Protein/Nucleic acid 
Ligand/lons 
Water 

B-factors 
Protein/Nucleic acid 
Ligand/lons 
Water 

R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


*Values in parentheses are for the highest-resolution shell. 


C2 


166.7, 221.6, 192.4 
90, 100.4, 90 
50-3.5 (3.56-3.5) * 
0.143 (0.583) 
8.1 (1.7) 

94.3 (72.8) 

3.6 (3.3) 


49.3-3.5 
81,638 
20.1/23.2 


29180 


0.009 
1.355 
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C2 


168.2, 222.6, 192.8 
90, 101.6, 90 
50-3.3 (3.36-3.3) 
0.153 (0.762) 
9.2 (1.1) 

99.4 (96.5) 

3.7 (3.3) 


48.9-3.3 
105636 
20.7 /25 


29151 
42 


102.8 
95.4 


0.009 
1.326 


CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature14607 


Corrigendum: Divergent 
reprogramming routes lead to 
alternative stem-cell states 


Peter D. Tonge, Andrew J. Corso, Claudio Monetti, 

Samer M. I. Hussein, Mira C. Puri, Iacovos P. Michael, Mira Li, 
Dong-Sung Lee, Jessica C. Mar, Nicole Cloonan, David L. Wood, 
Maely E. Gauthier, Othmar Korn, Jennifer L. Clancy, 

Thomas Preiss, Sean M. Grimmond, Jong-Yeon Shin, 
Jeong-Sun Seo, Christine A. Wells, Ian M. Rogers & Andras Nagy 


Nature 516, 192-197 (2014); doi:10.1038/nature14047 


In this Article, the address listed as Nicole Cloonan’s present address 
(QIMR Berghofer Medical Research Institute, Queensland 4006, 
Australia) should have been listed as her other affiliation with super- 
script 16, because the work she did was split equally between the 
two institutions. This has been corrected in the online versions of 
the paper. 
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CORRECTIONS & AMENDMENTS 


RETRACTION 
doi:10.1038/nature14591 


Retraction: Integrative genomics 
identifies APOE ¢4 effectors in 


Alzheimer’s disease 


Herve Rhinn, Ryousuke Fujita, Liang Qiang, Rong Chen, 
Joseph H. Lee & Asa Abeliovich 


Nature 500, 45-50 (2013); doi:10.1038/nature12415 


In this Article, we described integrative genomics analyses of Alzheimer’s 
disease and associated risk factors. However, reanalysis of the data has 
showed that sample numbers, image panels and data points were inap- 
propriately manipulated and inaccurate in the ELISA and subcellular 
localization studies presented in Figs 2d, e, 3b, g, h and 4c, as well as in 
corresponding Supplementary Figs 10-16. We are in the process of 
repeating these cell-based studies. We remain confident in the tran- 
scriptomics and human genetics analyses reported in the Article. 
However, given these issues, we wish to retract the Article in its entirety. 
We deeply regret this circumstance and apologize to the community. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature14606 


Corrigendum: Genome-wide 
characterization of the routes to 
pluripotency 


Samer M. I. Hussein, Mira C. Puri, Peter D. Tonge, 

Marco Benevento, Andrew J. Corso, Jennifer L. Clancy, 
Rowland Mosbergen, Mira Li, Dong-Sung Lee, Nicole Cloonan, 
David L. A.Wood, Javier Munoz, Robert Middleton, 

Othmar Korn, Hardip R. Patel, Carl A. White, Jong-Yeon Shin, 
Maely E. Gauthier, Kim-Anh Lé Cao, Jong-Il Kim, 

Jessica C. Mar, Nika Shakiba, William Ritchie, John E. J. Rasko, 
Sean M. Grimmond, Peter W. Zandstra, Christine A. Wells, 
Thomas Preiss, Jeong-Sun Seo, Albert J. R. Heck, 

Ian M. Rogers & Andras Nagy 


Nature 516, 198-206 (2014); doi:10.1038/nature14046 


In this Article, the address listed as Nicole Cloonan’s present address 
(QIMR Berghofer Medical Research Institute, Queensland 4006, 
Australia) should have been listed as her other affiliation with super- 
script 22, because the work she did was split equally between the 
two institutions. This has been corrected in the online versions of 
the paper. 
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To serve or not to serve 


When committees come knocking, scientists need to know which requests will benefit them 
and which will only steal their time — and how to tell the difference. 


BY ROBERTA KWOK 


nastasia Ailamaki fondly remembers 
A first experience serving on a 
grant-application review commit- 
tee for the US National Science Foundation 
(NSF). Through working with peers to 
evaluate and rank grant proposals asking 
for spectrometers and other instruments, 
Ailamaki, a computer scientist now at the 
Swiss Federal Institute of Technology in 
Lausanne, gained valuable insight into what 
makes an application clear and convinc- 
ing. “I adored that experience,’ she says. She 
credits it with helping her to prepare her own 
successful application for an NSF early-career- 
development grant. 
But like many researchers, Ailamaki has at 
times been overloaded with requests for her 
service. “First reaction is that I’m very flattered 


that I have been invited?’ she says. “Second is 
that I realize I really don't have time, by any pos- 
sible measure, to be on that committee. And the 
third reaction is to say yes.” She has served on 
committees of all types, including those deal- 
ing with promotions, department management, 
campus events and conference and workshop 
organization. Although many of these experi- 
ences have proved valuable, she now tries to 
consider requests more carefully before accept- 
ing them — weighing, for instance, whether she 
is uniquely qualified for the spot or whether the 
committee chair could easily find someone else. 

Committee work is tricky for scientists to 
navigate. On the one hand, it can offer many 
benefits: opportunities to network, learn about 
the state of the field, get ideas to improve 
research and influence funding decisions or 
policy. On the other hand, some research- 
ers become overburdened — they sacrifice 
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research time to sit in meetings, they draft 
recommendations that go unused or they get 
dragged into political disputes. And institu- 
tions may lack concrete guidelines for service 
requirements, making it difficult for research- 
ers to gauge whether their workload is fair. 
But careful strategizing can help scientists 
to make the most of their service. They should 
gather information about committees before 
agreeing to join, consider the work’s potential 
impact and proactively seek assignments that 
they feel passionate about. To help committees 
to run smoothly, members should actively aim 
to keep discussions on topic and treat peers 
respectfully. And as leaders, committee chairs 
should ensure that the process is efficient and 
professional (see “The ruling of the chair’). 
Junior researchers might feel obligated to 
accept every committee request. At some 
institutions, women or researchers from > 
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> under-represented minorities, in particular, 
may be recruited more often than their peers 
to increase diversity on a panel, and so might 
feel pressure to serve as a representative voice. 
But before deciding, scientists should consider 
whether the assignment is worthwhile for 
them personally. “You've got to get something 
out of it as well,” says Patricia Molina, head 
of the physiology department at Louisiana 
State University Health Sciences Center New 
Orleans. She also chairs the National Hispanic 
Science Network, a virtual organization that 
promotes research on issues important to the 
Hispanic community and fosters development 
of Hispanic scientists. 


THE POWER TO SAY NO 

It can be hard to work out which invitations 
to turn down because service requirements 
are sometimes vague and guidelines vary by 
institution. A regional university with a lim- 
ited graduate programme might expect faculty 
members to be heavily involved in university 
governance — for example, developing policies 
that are related to undergraduate education — 
whereas a research-focused university might 
value service with national and international 
professional associations. 

Researchers should ask their department 
heads, mentors or colleagues for advice on how 
to evaluate a request. Senior faculty members 
might know how much work a committee 
entails and the extent to which it will benefit 
a scientist’s career. They might also warn of 
political landmines, such as two departments 
that fight constantly over the same resources. 
For instance, a curriculum committee could 
be time-consuming because of a knotty 
battle to change entrenched teaching methods, 
says Maryrose Franko, senior science pro- 
gramme manager at the Howard Hughes 
Medical Institute’s Janelia Research Campus 
in Ashburn, Virginia. 

Scientists should also investigate the potential 
impact of the group’s work. People are often 
eager to serve on committees that advise 
federal government agencies because the invita- 
tion makes them feel 


important, says Tom “ If we don’t 

pee cn anara get involved, 

at the University o ee ° 

Colorado Boulder. rgege a lal 

But he adds that they absence of 

should ask the chair ‘nts 

about the fate of their °°'C™ ifi . 
input. 


findings. In some 
cases, the agency is 
committed to funding the recommendations, 
but in others, reports are simply circulated to 
political staffers with no guarantee that anyone 
will attempt to implement the ideas. 

For some scientists, the chance to influence 
important issues might be worth the risk of 
wasting time. In 2011-13, geophysicist Steve 
Hickman served on a committee that advised 
the US Department of the Interior on improving 
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EASE THE PAIN 


The ruling of the chair 


Institutions and committee chairs can take 
steps to make service assignments fair 
and painless for researchers. University 
departments, for instance, can be 
transparent about how much service work 
each faculty member is performing, says 
Joya Misra, a sociologist at the University 
of Massachusetts Amherst. She says that 
departments could e-mail researchers every 
year with all service assigned over the past 
decade. If some scientists realize that they 
have been doing more than their colleagues, 
they might feel less guilty about declining 
requests. At the University of Maryland in 
College Park, higher-education researcher 
KerryAnn O’Meara and her colleagues 
analysed annual faculty reports to calculate 
the average number of service activities 
performed by professors at a given rank 
and college — for example, an associate 
professor in the college of computer, 
mathematical and natural sciences — and 
published the data on an internal website. 
Faculty members use the site to compare 
their service workload to their peers’ and 
decide whether to accept assignments, says 
O'Meara. She is willing to share templates 
with other universities to show them how to 
collect and present similar data. 

If researchers bear an unusually heavy 


safety of offshore development of oil and gas. 
Hickman, who now directs the US Geological 
Survey Earthquake Science Center in Menlo 
Park, California, did not know whether the 
group's advice would be followed. “It is a 
gamble,’ he says. “But if we don't get involved, 
decisions will be made in the absence of scien- 
tific input” Their work paid off — some of the 
groups recommendations, such as setting up an 
ocean energy-safety institute, are now in place. 

Service can also pay off in networking 
opportunities. Members of a department semi- 
nar committee, for example, have a chance to 
invite speakers in their field whom they would 
like to meet. These visitors might give the 
scientist feedback on ongoing projects or write 
reference letters in the future. Serving with a 
professional association could enable gradu- 
ate students and postdocs to meet potential 
employers, and organizing a conference will 
earn a researcher name recognition in the 
field. In 2011, when Megan Carey organized an 
international neuroscience symposium at her 
institute, she became acquainted with many of 
the speakers she had invited — some of whom 
later asked her to give talks. “It was an incredi- 
ble networking opportunity for me,’ says Carey, 
a neuroscientist at the Champalimaud Centre 
for the Unknown in Lisbon, Portugal. 
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service load, institutions can compensate 
by reducing their teaching requirements, 
suggests Misra. Creating default rotations 
for time-intensive departmental roles 
can also help to distribute the work fairly 
between faculty members. 

Committee chairs should keep the 
process efficient. When ecologist Jay 
Stachowicz chaired an educational-policy 
committee at the University of California, 
Davis, he began each meeting by noting 
items that seemed uncontroversial — such 
as eliminating a course from a major 
requirement because it was no longer 
offered — and asking whether anyone 
opposed passing them. If not, he moved on. 

The chair should also ensure that 
members treat each other respectfully. 
Senior faculty members who try to bulldoze 
junior researchers may need a private 
reminder that each person’s opinion counts 
equally, regardless of his or her rank. “| want 
everybody to agree that they’re going to 
park their titles at the door,” says John Murry 
Jr, coordinator of the higher-education 
programme at the University of Arkansas in 
Fayetteville. If some people are dominating 
the discussion, the chair can intervene or 
pull quieter members aside during a break 
to encourage participation. AK. 


And some committee members forge 
personal, not just professional, connections. 
When Hickman chaired a science-advisory 
group for the International Continental 
Scientific Drilling Program, the team took 
trips to drilling sites around the world together, 
which helped to build camaraderie. “Some of 
my best friends I've made in my field have been 
on committees like this,’ he says. 


COMMITTEE PHOBIA 
For scientists who loathe committees and 
simply want to do their research, service 
assignments that benefit their immediate 
working environment may be the most palat- 
able. By participating in faculty searches, for 
instance, researchers can select colleagues 
who could positively influence their work. 
“Being able to shape your environment is 
something that’s important for all, even for the 
person who says, ‘I just want to get my science 
done,” says Jeremy Boss, an immunologist at 
the Emory University School of Medicine in 
Atlanta, Georgia. A new colleague could sug- 
gest ideas to improve research, such as studies 
to read or experimental techniques to try. 
Researchers may also volunteer for commit- 
tees that appeal to them, instead of waiting for 
requests. “The worst thing is to get assigned 
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to some random committee that you have 
no passion for,” says Cech. Once they have 
chosen committees for themselves, scientists 
can use those service obligations as reasons 
to decline less-desirable assignments. 

After committing to a group, scientists 
should execute their duties diligently — it 
is always possible that the committee chair 
will evaluate them for a promotion later. 

Ifthe committee’s goal is vague or discus- 
sions are unfocused, researchers can ask the 
chair to clarify the mission with adminis- 
trators or to provide agendas in advance. 
During meetings, members should avoid 
making comments that do not directly 
serve the committee's purpose. For instance, 
when developing policy, people often tell 
anecdotes to show why the regulation is nec- 
essary, says Boss. “All it does is waste time,” 
he says. Instead, the team should concentrate 
on the wording of the policy and ensure that 
it covers the necessary scenarios. 

Researchers outside traditional uni- 
versities may encounter a wide variety of 
expectations and styles. Scientists at the 
Janelia Research Campus have minimal 
service obligations so that they can focus on 
research, whereas those at the Wilderness 
Society, a conservation organization in 
Washington DC, are encouraged to serve on 
committees that influence policy and man- 
agement decisions. At the Champalimaud 
Centre, a small group of neuroscientists has 
been shaping the direction of the budding 
programme. Faculty members are involved 
in more types of service than are those in 
academia, and their meetings can be more 
intense and efficient. For example, they all 
participate in hiring decisions, but rather 
than interviewing candidates over several 
months, they gather for a one- or two-day 
symposium to see applicants give talks. 

Scientists should discuss committee- 
service expectations during their job-offer 
negotiations. A supervisor might even 
be able to provide precise requirements. 
Molina expects junior researchers in her 
department to spend no more than 5% of 
their time on committee work; mid-level 
researchers are expected to spend 10-15%. 

Ultimately, science cannot run without 
service. Researchers need to review each 
other's proposals, contribute to professional 
organizations and help universities to foster 
strong research and student development. 
Faculty members who avoid all committees 
risk isolating themselves from the com- 
munity or being perceived as slackers. “In 
science, people are expected to be givers 
and sharers,” says Molina. Still, that is no 
reason to feel guilty for setting boundaries. 
“T believe in participating and volunteer- 
ing, she says, “but there’s a limit.” m 


Roberta Kwok is a freelance writer in 
Seattle, Washington. 


TURNING POINT 


Heather Schneider 


For her postdoc, ecologist Heather Schneider 
joined Project Baseline, a nationwide US 
initiative that is developing a seed bank for 
future scientists to study how plants are 
evolving in response to climate change. The 
project has left her little time for her own 
research at the University of California, Santa 
Barbara, but the skills she has gained have 
broadened her career avenues. 


What is a field season like? 

It’s really daunting. Project Baseline’s goal is to 
collect seeds from 43 species — at 10 sites for 
each one. The project so far has collected 3 mil- 
lion seeds from species both native and intro- 
duced. My adviser, Susan Mazer, and I oversee 
collection in the western region — 237 distinct 
plant populations of 20 species — and this is 
the final of 3 field seasons. I spend January to 
March getting field permits to collect specimens 
in national and state parks, nature preserves and 
the University of California reserves. Then I use 
herbarium records to find historical popula- 
tions. I try to visit each of our sites twice a season 
— once while plants are in bloom, to find popu- 
lations more easily and to collect environmental 
data, and again to gather seeds. Last year, our 
field season ended in mid-October. 


What about this project lured you away from a 
pure research focus? 

Few things are as important as understanding 
how ecosystems will respond to climate 
change. I was interested in helping to create 
a resource that would be useful for both basic 
and applied science for the next 50 years. To 
me, that would have a big impact on ecology 
and evolutionary biology — much bigger than 
any single paper I would ever write. I also felt 
that I have the set of skills — field botany, plant 
identification and collection of herbarium 
specimens — necessary for the job. 


Did it feel risky to move away from 
conventional research? 

A little. Although my career trajectory has zig- 
zagged, there has been one underlying theme 
— assessing the impact of human-made threats 
to ecosystems. I have focused on invasive spe- 
cies, air pollution and habitat degradation. I 
joke that when you work on short-term grants, 
you end up with a long tail of ‘publications in 
progress’ that follow you from job to job. ’m 
still working on papers from one to two jobs 
ago. So it was appealing that there would be 
less pressure to publish in this position, which 
could give me a chance to catch up on papers 
I'm still working on. 
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Does publishing less concern you? 

The principal investigators on the project 
made sure that our efforts benefited my and 
the other postdocs’ careers. Susan and I work 
ona greenhouse experiment in the off-season, 
when we're not in the field for Project Base- 
line. We have one paper in revision and one in 
review, so I still am getting papers out. 


What are your hopes for future use of this 
resource? 

The research possibilities are huge. Given my 
own interests, I hope that people will use it to 
look at ecological interactions. For example, as 
pollinator communities change, how will that 
affect wild-plant reproduction? I’m also inter- 
ested in what the weedy species will do — will 
the geographical areas where they are found 
shrink or expand? 


What are your job prospects? 

I would be interested in a teaching job ata 
smaller university. Iam OK not ending up at 
a top-tier research university because funding 
rates are not that encouraging. And the skills 
I have gained on Project Baseline — project 
management, budgets, organization, trouble- 
shooting — are applicable to all kinds of other 
jobs. 


Do you plan to promote use of Project 
Baseline data in future? 

Yes. The postdocs on the project want to feel 
that this resource will be well cared for. I know 
there are plans to advertise it widely. The prin- 
cipal investigators invited all the postdocs to be 
on the advisory board, and it is nice to know 
that we will have a part in evaluating the pro- 
posals for its use in the future. m 


INTERVIEW BY VIRGINIA GEWIN 


This interview has been edited for length and clarity. 
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Ua SCIENCE FICTION 


THE SHOULDER OF ORION 


BY ERIC GARSIDE 


ic Williamson was lying on the grass 

R= watching the stars fly by over- 

ead. Judging by the constellations, 

he estimated that they were about halfway 
between Saturn and Uranus. 

“Ric, query,” announced the small grey- 
skinned being lying next to him. 

“Go on then, Norbert.” 

“What will happen to my consciousness; 
after Iam dead?” 

Ric grinned, laced his fingers together and 
put his hands behind his head. 

“Well, nothing. That’s sort of what dead 
means, Norbert.” 

“Have I done something to upset you? Do 
you no longer have a need of me?” 

Ric laughed a bit. 

“Oh we do, Norbert. We will always 
have need of you. In fact, you'll start being 
reprinted once the Gaia exits the gate.” 

The smile faded from Ric’s face. This 
wouldn't be the first time hed prepared a 
Synthetic to die, but that didn’t make break- 
ing your friend’s digital heart any easier. 

“How many times have I died, Ric?” 

“Eleven.” 

Waves of sadness and confusion crashed 
upon the silicon shores of Norbert’s mind. 

“T don’t want to die, Ric.” 

“Hold out your hand; Ric said, reaching 
deep into the soil and grabbing a handful of 
earth. “Do you know what that contains? 
Death. Bits of dead star, of dead planet; 
traces of plants, fragments of animals. All 
things die, Norbert. But in their death, new 
life takes root.” 

Norbert used his fingers to push the pile 
of dirt around in his palm. 

“Our lives and deaths rest in your hands, 
Norbert. We need you to stay behind and 
destroy the jumpgate once the Gaia is 
through. If the Black Mass finds out where 
we've gone, they will follow.’ 

Without breaking his gaze from the dirt, 
Norbert replied simply: “I understand” 


Norbert climbed the ladder to the cockpit 
of his starfighter, with Ric close behind. As 
Norbert strapped himself in, Ric reached 
into his pocket and fished something out of 
it. Dangling from a small plastic cord was an 
oddly shaped bit of metal, which he placed 
in Norbert’s hand. 

“Tt's a good luck charm.” 

“But you told me luck was just people 
taking probability personally, Ric.” 
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A life-changing experience. 


Norbert’s stoic gaze met one last sly wink 
from Ric before the flight shield secured in 
place. 

“Think of me sometimes, Norbert,’ Ric 
yelled, his voice muffled almost entirely by 
the shield. 


The Gaia lurched painfully slowly towards 
the jump gate. Beams of pure chaos erupted 
in every direction as the gate powered up. A 
splash of brilliant light, and the moon-sized 
mothership was gone. As the engine trails 
of the Gaia faded into the dark, the onboard 
computer in Norbert’s craft snapped to life. 

PRIMARY OBJECTIVE COMPLETE; 
THE GAIA HAS LEFT THE SECTOR. 

NEW PRIMARY OBJECTIVE ENABLED: 
DESTROY THE TANNHAUSER GATE. 

The weapons system of the craft enabled, 
and Norbert set to work. The gate was mas- 
sive. Simply to fly round its circumference 
took almost an hour, and destroying it was 
very systematic work. First the shield gen- 
erators needed to be taken out. Then the 
back-up reactors, followed by the primary 
reactors, and finally the data centres. 

Explosion after explosion tallied Norbert’s 
progress, each pass over the gate leaving less 
of it than before. After long gruelling hours 
of work, the final data centre erupted ina 
quick burst of flame. 

PRIMARY OBJECTIVE COMPLETE; 
THE TANNHAUSER GATE IS DISABLED 

TABULA RASA ENGAGED. 

Norbert looked towards the computer, 
his face contorting in distress. The flight 
systems of the craft were disabled, and a 

new waypoint was 
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it automatically shut down to prevent an 
overload. 

REACTION LIMITERS DISABLED; 
OVERLOAD IN APPROXIMATELY 
5 MINUTES. 

His emotion subsystem came back online 
just in time for him to appreciate the crushing 
realization of his own mortality. Panicked, 
the synthetic humanoid frantically tried to 
activate any control to avert his imminent 
demise. Every button played a denial sound, 
every controller moved freely without 
responsive feedback. 

If mechanical men could cry, Norbert 
certainly would have done. Instead, he was 
forced to come to grips with the knowledge 
of his death simply by hanging his head 
forlornly. 

That’s when he noticed it, hanging by a 
plastic thread on his chest; the odd bit of metal 
Ric had given him. He took it in his hands, 
running his fingers over the channels and 
grooves in the amulet. It reminded him of... 

A key! A spark of realization hit, and 
Norbert reached under his seat and pulled 
back the floor panelling to expose the 
manual-override panel. He slotted the key 
in place, closed his eyes and turned it. A 
small metal click sounded. Norbert tossed 
away the panel face. 

Within the chamber was a rather large, 
rather red button surrounded by caution 
tape and labelled “Thermonuclear Engine 
Ejection”. Norbert smashed the button with 
incredible force, and a comically small reac- 
tor fired backwards out of the ship. 

With the ship running on reserve power, 
all non-essentials (such as automated con- 
trol) were disabled, returning command of 
the craft to Norbert. 

His chances of survival were slim, there 
was no doubting that. He might have enough 
power to outrun the explosion, but did he 
have enough to get anywhere after that? And 
to what end? And for how long? 

And that’s when he realized it; the 
meaning of life. 

It was wanting to live. 

In this moment, a perfect storm of chaos 
and clarity overcame him. Norbert became 
more than wires, diodes and synthetic 
emotions; he became truly human. 

Grasping the controls firmly, he fixed his 
gaze on the stars, and took his chance. = 
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